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(57) Abstract: The present invention relates to a DNA molecule or vector and a host cell contain- 
ing this DNA molecule or vector which can be used to produce a heterologous polypeptide under 
conditions that elicit a cold shock response in the host cell. The DNA molecule and vector include 
a nucleotide sequence encoding a heterologous polypeptide and a promoter and 5'-UTR from a 
cold shock inducible gene which directs its expression. In addition, an AT-rich sequence that en- 
hances translation under cold shock inducible conditions is either present in the coding sequence 
of the heterologous polypeptide or in an additional element inserted between the coding sequence 
and the cold shock inducible promoter and 5'-UTR. 



BEST AVAILABLE COPY 



SDOCID: <WO_ 



_03074657A2J_> 



WO 03/074657 PCT/US03/05531 

- 1 - 



COU3 SHOCK INDUCIBLE EXPRESSION AND PRODUCTION OF HETEROLOGOUS 

POLYPEPTIDES 



BACKGROUND OF THE INVENTION 
Field of the Invention 

[0001] The present invention relates to a DNA molecule, 
vector, host and method for cold shock inducible expression and 
production of heterologous polypeptides. 

Description of the Related Art 

[0002] When Escherichia coli cells grown at 37 °C are 
transferred to a low temperature such as 15°C, a set of proteins 
called cold-shock proteins is transiently induced at a very high 
level during a growth lag period called the acclimation phase 
(Jones et al. f 1987). These cold-shock specific proteins 
include: CspA, CspB, CspG, Cspl, CsdA, RbfA, NusA, and PNP (for 
review see Phadtare et al . , 1999, 2000 and 2002; Yamanaka et 
al., 1998). Among them, CspA has been identified as a major 
cold-shock protein consisting of 70 amino acid residues. 

[0003] The CspA family of E. coli consists of nine 
homologous proteins, CspA to Cspl, but among them only CspA, 
CspB, CspG and Cspl are cold-shock inducible. Interestingly, the 
cspA gene is dispensable at both normal and low growth 
temperatures (Bae et al . , 1997). None of the CspA homologues 
appears to be singularly responsible for cold-shock adaptation, 
as members of the CspA family functionally overlap each other 
during cold-acclimation of cells (Xia et al . , 2001). Indeed, a 
kcspAbcspB&cspG triple deletion strain is still viable, while a 
LcspAhcspBbcspGkcspE quadruple deletion strain is unable to form 
colonies at low temperature. Interestingly, any single gene from 
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the nine CspA homologues except CspD has been shown to be 
capable of complementing the cold-sensitivity of the quadruple 
deletion strain (Xia et al . , 2001). It has been shown that CspA 
is differentially regulated from CspB, CspG and Cspl (Etchegaray 
et al., 1996 and Wang et al . f 1999). High levels of CspA 
production are seen between 24 °C and 10°C / while CspB and CspG 
are produced only after temperature shifts to below 20°C, the 
maximum induction being at 15 °C. Cspl is induced between 10- 
15°C. It has also been shown that CspA, CspB and CspG are 
induced at low temperature under conditions that completely 
block protein synthesis (Etchegaray et al . , 1999). 

[0004] The cspA expression is regulated in a complex manner, 
that is at levels of transcription, mRNA stability and 
translation efficiency (for review see Phadtare et al., 2000 and 
Yamanaka et al . , 1998). The cspA gene has an unusually long 5' 
untranslated region (5'-UTR) consisting of 159 bases. Deletion 
analysis of the cspA 5 1 -UTR showed that this region is 
responsible for its extreme instability at 37 °C (half-life less 
than 12 sec) , and has positive effect on mRNA stabilization at 
low temperature (Jiang et al . , 1996 and Mitta et al . , 1997). 
The cspA mRNA is dramatically stabilized (half-life more than 20 
min) immediately following cold shock. This stabilization is 
transient and is lost once cells are adapted to low temperature. 
This in turn regulates the expression of cspA. 

[0005] The cold-shock induction of cspA is quite different 
from the heat-shock induction, as the cspA induction does not 
require a specific transcription factor. Interestingly, the 
cspA promoter is active at 37 °C, but CspA is greatly reduced as 
the cspA mRNA is extremely unstable at this temperature. Thus, 
the 5' -UTR of the cspA mRNA plays a crucial role in its cold- 
shock inducibility . Recently, it has been shown that CspA is 
produced at 37 °C during early exponential growth phase and its 



BNSDOCID: <WO 03074657A2_I_> 



WO 03/074657 PCT/US03/05531 

- 3 - 



mRNA becomes unstable by mid- to late-exponential growth phase 
(Yamanaka et al . , 2001 and Brandi et al . , 1999). The 5'-UTR 
region of the cspA mRNA contains a "Cold Box" sequence conserved 
among several cold-shock mRNAs . This region forms a stable 
stem-loop structure followed by an AU-rich sequence (Fang et 
al., 1998). The laboratory of the present inventors showed that 
this region is essential for the normal cspA mRNA induction 
after cold shock, as a deletion of the stem-loop significantly 
destabilizes the mRNA and reduces the cold shock-induced cspA 
mRNA amount by approximately 50%. The AU-rich track, however, 
slightly destabilizes the mRNA. The integrity of the stem is 
essential for the stabilizing function, while that of the loop 
sequence is less important (Xia et al., 2001). 

[0006] Overexpression of a mutant cspA mRNA devoid of both 
' the initiation codon (AUG) and the coding sequence results in a 
severe inhibition of growth at low temperature along with a 
derepression of the chromosomal cspA expression. Further, the 
overexpressed RNA is stably associated with the 30S and 70S 
ribosomes. Results from the laboratory of the present inventors 
demonstrated that the 5'-UTR by itself had a remarkable affinity 
to ribosomes at low temperature. Overproduction of the 5 ' -UTR at 
15°C results in delayed induction of the cold-shock response and 
in the prolonged synthesis of not only CspA, but also CspB and 
CspG (Fang et al., 1998 and Jiang et al . , 1996). These effects 
are repressed by coproduction of the 5' -UTR together with CspA. 
The AT-rich sequence immediately upstream of the -35 region of 
the cspA promoter has been shown to function as an UP element to 
enhance cspA transcription. Deletion of the UP element resulted 
in diminished activity of the cspA promoter (Goldenberg et al., 
1997 and Mitta et al . , 1997). Another important factor that 
contributes towards higher promoter activity is the presence of 
a TGn motif immediately upstream of the -10 region (Kumar et 
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al., 1993). It is reported that this motif together with the -10 
region constitutes the extended -10 region and the -35 region is 
dispensable in the presence of this region. 

[0007] Importantly, the expression of cspA is also regulated 
at the level of translation. The preferential synthesis of 
cold-shock proteins during the growth lag period (the 
acclimation phase) suggests that their mRNAs, unlike most other 
cellular (non-cold-shock) mRNAs, possess a mechanism to form the 
translation initiation complex at low temperature without the 
cold-shock ribosome factors, such as RbfA and CsdA. The recent 
data from the laboratory of the present inventors show that 
there are elements within the coding sequence of CspA that 
enhance its translation at low temperature. mRNAs for cold- 
shock proteins such as CspA, CspB, CspG, Cspl, CsdA and RbfA 
have been proposed to have an element called the downstream box 
(DB) in the coding region which enhances translation initiation 
(Mitta et al . , 1997). Originally, the DB sequence was proposed 
to be complementary to a region in the penultimate stem of 16S 
rRNA and is located a few bases downstream of the initiation 
codon. It has been debated how DB enhances translation 
initiation (Etchegaray et al . , 1999 and O'Connor et al . , 1999) 
and the originally proposed mechanism by facilitating the 
formation of translation preinitiation complex through binding 
to 16S rRNA may not be the precise mechanism. 

[0008] The phenomenon termed as 1 LACE ' effect (low- 
temperature antibiotic effect of truncated cspA expression) was 
observed in E. coll (Jiang et al . , 1996 and Xia et al . , 2001). 
When a truncated cspA gene is overexpressed at low temperature, 
cell growth is completely blocked. This has been demonstrated 
to be caused due to the entrapment of almost all the cellular 
ribosomes by the truncated cspA mRNA. This truncated mRNA is 
still able to form the preinitiation complex with non-adapted 
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ribosomes at low temperature. Unambiguous demonstration of 
ribosome entrapping by truncated cspA mRNA has been carried out 
by incorporating a terminator codon either at the second 
(pAOlS) , or the eleventh (pAlOS) or the 31st (pA30S) codon in 
the cspA gene cloned in a pUC vector (Xia et al . , 2001). At 
37°C, cells carrying these plasmids are perfectly normal, while 
upon cold-shock cells stop to grow completely. They are unable 
to form colonies at 15°C and with 35 S-Met, no protein was 
labeled. When polysome profiles of these cells were analyzed, 
cells expressing only the initiation codon (pAOlS) contained 
only monosomes without any polysome peaks. Cells with pAlOS 
showed di- and monosomes and cells with pA30S showed tri-, di- 
and monosomes again without any large polysomal peaks. 
Furthermore, a major cellular mRNA (Ipp) was shown to be 
excluded from polysomes by taken over being the truncated cspA 
mRNA. These results clearly demonstrate that the robust 
translatability of the cspA mRNA is determined at the step of 
initiation. In this study, the laboratory of the present 
inventors also showed that the upstream region within the 5'-UTR 
of the cspA mRNA plays an important role in the formation of the 
translation initiation complex leading to the LACE effect (Xia 
et al. , 2001) . 

[0009] CspA and its homologues are proposed to be RNA 
chaperones by destabilizing secondary structures in mRNAs (Bae 
et al., 2000; Jiang et al . , 1997 and Phadtare et al . , 2001). 
Since the AcspA&cspB&cspGAcspE quadruple deletion strain can not 
grow at low temperature while a single csp gene is able to 
complement the cold-sensitivity (Xia et al . , 2001), the RNA 
chaperone function is considered to be crucial for efficient 
translation of cellular mRNAs at low temperatures by blocking 
stable secondary-structure formation, which is inhibitory to the 
mRNA translation (Phadtare et al . , 2001). 
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[0010] Cold-inducible vectors containing the cspA promoter 
have been shown to be useful for expression of aggregation-prone 
proteins such as, preS2-S ' -(3-galactosidase and TolAI-(3-lactamase 
(Mujacic et al . , 1999; Vasina et al . , 1997 and Vasina et al . , 
1996). U.S. Patent no. 6,333,191 Bl discloses promoters of cspA 
and cspB and vectors carrying such promoters. 

[0011] Citation of any document herein is not intended as an 
admission that such document is pertinent prior art, or 
considered material to the patentability of any claim of the 
present application. Any statement as to content or a date of 
any document is based on the information available to applicant 
at the time of filing and does not constitute an admission. as to 
the correctness of such a statement. 

SUMMARY OF THE INVENTION 

[0012] The present invention provides a DNA molecule and 
vector capable of expressing a heterologous polypeptide in a 
host cell under conditions that elicit a cold shock response in 
the host cell. The DNA molecule according to the present 
invention contains a nucleotide sequence encoding a heterologous 
polypeptide operably linked to a cold shock inducible gene 
promoter and 5 1 -untranslated region <5'-UTR) sequence which 
directs the expression and production of a heterologous 
polypeptide in a host cell under conditions that elicit a cold 
shock response. In addition, either the nucleotide sequence 
encoding the heterologous polypeptide has an AT rich sequence 
(naturally-occurring or created by, i.e., site-directed 
mutagenesis) at about the 4 th to 7 th codons downstream from the 
initiation codon or a translation enhancing element is inserted 
between the nucleotide sequence encoding the heterologous 
polypeptide and the cold shock inducible promoter and 5 1 -UTR 
sequence. This translation enhancing element contains an 
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initiation codon and an AT rich sequence at about the 4 th to 7 th 
codons downstream from the initiation codon to form a 
translational in-frame fusion with the nucleotide sequence 
encoding the heterologous polypeptide. 

[0013] The present invention also provides a host cell 
transformed with the DNA molecule or vector according to the 
present invention and a method of using such a host cell for 
producing a heterologous protein under cold shock inducible 
conditions, such as for whole cell or cell lysate NMR 
spectroscopy . 

BRIEF DESCRIPTION OF THK DRAWINGS 

[0014] Figure 1 shows a schematic representation of the 
structure of the prototype cold-shock vector pCold. 

[0015] Figures 2A and 2B are gels showing expression of C. 
elegans proteins at 37°C (Fig. 2A) and 15°C (Fig. 2B) . In Fig. 
2A, protein expression with (+) and without (-) IPTG in each 
case is shown. In Figure 2B, expression of C. elegans proteins 
at 15°C using cold shock vector is shown at 0 f 12, and 24 h in 
each case. 

[0016] Figure 3 are gels showing three time profile of 
production of C. elegans proteins with cold shock vector. 0, 
24, 48, 72, 96 and 120 h expression of WR49, WR35, and WR26 at 
15°C is shown. The total cell lysates were analyzed. The gels 
were stained with Coomassie Blue dye. 

[0017] Figures 4A and 4B are SDS-PAGE analyses of the 
expression of y-interferon using the cold shock vector. 
Coomassie staining of SDS-PAGE is shown in Fig. 4A Lane 1: 
before IPTG at 15°C; Lanes 2-5: 24, 48, 72 and 96 h expression, 
respectively, at 15°C; lane 6: expression at 37°C. SDS-PAGE 
analysis of pulse-labeled cells is shown in Fig. 4B. Lane 1: 
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expression at 37°C; lanes 2-4: lh, 3h and 5h expression, 
respectively, at 15°C. 

[0018] Figure 5 is a graph showing correlation between the 
A/T contents of the N 12 -downstream sequences and p-galactosidase 
activities. For each of the 57 randomly selected clones, the 
number of A plus T was counted and plotted against the 
corresponding p-galactosidase activity. 

[0019] Figure 6 shows a graph of the relationship between 
protein levels and p-galactosidase activities of the selected 
clones. The cells containing respective plasmids were grown at 
37°C to the OD 600 nm of 0.5 and were induced with 1 mM IPTG. The 
protein expression was analyzed by SDS-PAGE and band intensities 
were measured by densitometric analysis. 

[0020] Figure 7 is a combined bar graph and gel showing |3- 
galactosidase activities, protein levels and lacZ mRNA levels of 
selected clones. 

[0021] Figure 8 are graphs and gels of the distribution of 
lacZ and ompA mRNA in polysome profiles of the clones showing 
highest and lowest p-galactosidase activities. Respective N 12 
sequences (SEQ ID NO: 14 and SEQ ID NO: 26) are shown. 

[0022] Figure 9 shows a schematic representation of the 
structure of the cspA mRNA 5' -UTR. Every 10 th base is dotted. 
The upstream box (UB) is boxed, the SD sequence is in bold, the 
initiating codon is underlined and the cspA ORF is indicated in 
bold and is boxed. The 5 » -UTR sequence upstream of the cspA ORF 
corresponds to SEQ ID NO: 48 and 3 ' -UTR sequence downstream of 
the cspA ORF corresponds to SEQ ID NO: 49. 

[0023] Figure 10A and 10B show Epsilon-like sequences 
upstream of thfe SD sequence in the 5 1 -UTR cspA mRNA. Fig. 10A 
is a mutational analysis of the Epsilon-like sequence, where the 
wild-type sequence is shown as Wt (SEQ ID NO:28). The Epsilon 
sequence is shown in bold. Fig. 10B is a mutational analysis of 
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the SD sequence and the distance between the Epsilon sequence 
and the AUG codon. The Epsilon-like sequences 1-16 are SEQ ID 
NOs : 2 9-4 4 , respectively, 

[0024] Figures 11A-11C show schematic representations of the 
structures of the pCold I, II and III vectors. 

[0025] Figures 12A-12C show nucleotide sequences, SEQ ID 
NO:45 (Fig. 12A) , SEQ ID NO:46 (Fig. 12B) , and SEQ ID NO:47 
(Fig. 12C) , which are inserted into the pUC19 vector cleaved 
with Hindlll and EcoRI and filled in with Klenow. 

[0026] Figure 13 is a sequence alignment of the promoter, 5 f - 
UTR, and the first 13 codon nucleotides of cspA, cspB, cspG and 
cspl . Nucleotides identical to cspl are shown as dots. To 
maximize the alignment, some gaps have been introduced and these 
are indicated by dashes. The transcription start sites are in 
bold letters and are marked as +1. The translation start codon 
ATGs are also in bold letters and are underlined. The most 
homologous sequences (UP element, -35 region, -10 region, cold 
box, upstream sequence, Shine-Dalgarno (SD) sequence, and 
downstream box) are boxed and indicated. 

[0027] Figures 14A and 14B show a SDS-PAGE gel stained with 
Coomassie Brilliant Blue of the supernatant (500 ml culture of 
E. coll BL21 harboring pColdl-EnvZ ATP binding domain incubated 
for 48 hrs. cold shock) after ultra centrif ugation stained with 
Coomassie Brillant Blue (Fig. 14A) and a SDS-PAGE gel of total 
cell lysate, pellet and supernatant following cold shock (Fig. 
14B) of E. coll BL21. EnvZ ATP-binding domain is also known as 
fragment B. In Fig. 14B, lane 1: 50 yil total cell lysate loaded 
after cold shock 12 hrs; lane 2: 50 \il total cell lysate loaded 
after cold shock 24 hrs; lane 3: 50 ]il total cell lysate loaded 
after cold shock 36 hrs; lane 4: 50 pi total cell lysate loaded 
after cold shock 48 hrs; lane 5: 2 pi of pellet (total 1 ml) 
loaded after 15,000 rpm, 30 min. centrif ugation during second 
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round of sonication; lane 6: 2 pi of pellet (total 1 ml) loaded 
after 45,000 rpm, 4 hr ultracentrif ugation during second round 
of sonication; lane 7: 1 \il supernatant loaded after 45,000 rpm, 
4 hr. ultra-centrif ugation during second round of sonication; 
lane 8: 1 pi supernatant (sample for NMR) loaded after 45,000 
rpm, 4 hr ultracentrif ugation during the first round of 
sonication . 

[0028] Figures 15A and 15B show the ( 1 H- 15 N) HSQC NMR spectrum 
of cell lysate supernatant containing EnvZ ATP-binding domain 
without purification (Fig. ISA) and the HSQC NMR spectrum of 
purified EnvZ ATP-binding domain (Fig. 15B) . 

DETAILED DESCRIPTION OF THE INVENTION 

[0029] Using Escherichia coli as a model system, the 
laboratory of the present inventors have found that there are a 
number of cold-shock proteins that are required for growth only 
at low temperature . Among these cold-shock proteins , CspA is the 
major cold-shock protein, and is induced upon cold shock at a 
level of more than 10 6 molecules/cell (10~ 4 M) . The initiation of 
translation of the cspA mRNA is shown to be extraordinarily 
efficient at low temperatures and the cspA mRNA is found to 
possess the ability to suppress all other cellular mRNA 
translation. For example, when a nonsense codon is added 
immediately after the initiation codon of cspA, the mRNA from 
the truncated cspA gene in a pUC vector traps all the cellular 
ribosomes completely blocking other cellular protein synthesis 
upon cold shock and thereby completely inhibiting cell growth at 
low temperature. This phenomenon is termed as the "LACE ef f ect" . 
Therefore, using the remarkable translational advantage of the 
cspA mRNA together with its cold-shock-inducible mRNA 
stabilization and its highly efficient promoter, the present 
inventors designed a cold-shock inducible vector-host system in 

/ 
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which all cellular ribosomes are dedicated only for the 
production of a cloned gene product at low temperature, thereby 
providing for extremely high expression of a gene 
product /protein of interest upon cold shock. 

[0030] While cold-inducible vectors containing the cspA 
promoter have been shown to be useful for expression of 
aggregation-prone proteins such as preS2-S 1 -p-galactosidose and 
TolAI-p-lactamase (Mujacic et al . , 1999; Vasina et al., 1996 and 
1997) , other elements are most critical for efficient 
translation at a low cold shock inducible temperature. 

[0031] The present invention thus provides a DNA molecule 
capable of expressing a heterologous polypeptide under cold 
shock conditions, where the DNA molecule is preferably a self- 
replicating expression vector. The DNA molecule according to 
the present invention contains a nucleotide sequence encoding a 
heterologous polypeptide/protein of interest and a cold shock 
inducible promoter and 5 ' -untranslated region (5'~UTR) operably 
linked thereto to control and direct the expression and 
production of the heterologous polypeptide of interest in a host 
cell under conditions that elicit a cold shock response. In 
order to enhance translation efficiency, either the nucleotide 
sequence encoding the heterologous polypeptide has an AT rich 
sequence (naturally-occurring or created by, i.e., site-directed 
mutagenesis) at about the 4 th to 7 th codons downstream from the 
initiation codon or a translation enhancing element is inserted 
between polypeptide and the cold shock inducible promoter and 
5 ? UTR sequence. This translation enhancing element contains an 
initiation codon and an AT rich sequence at about the 4 th or 7 th 
codons downstream from the initiation codon to form a 
translational in-frame fusion with the nucleotide sequence 
encoding the heterologous polypeptide. 
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[0032] The present inventors have discovered that the AT rich 
sequence enhances translation at the translation initiation 
step. It is proposed that the AT rich sequence, which does not 
have secondary structure downstream of the initiation codon, may 
facilitate ribosome entry, and thereby enhance the efficiency of 
translation initiation and subsequent elongation reaction. . This 
in turn would increase the mRNA stability and the final 
polypeptide/protein production. Messenger RNA rich in A or A/T 
is unstructured and such an easily deformable mRNA is proposed 
to be efficiently accommodated by ribosomes, thus being 
favorable for translation initiation and/or early elongation. 

[0033] The term "heterologous polypeptide" is intended to 
mean any polypeptide that is not naturally expressed from a cold 
shock inducible promoter in a naturally occurring host cell. 
Thus, while the "heterologous polypeptide" can be a polypeptide 
from the same source as the cold shock inducible promoter and 
5'UTR, e.g., E. coll, as long as it is not naturally expressed 
from this promoter, preferably, this polypeptide is naturally 
expressed in a source different from the source of the cold 
shock inducible promoter and 5 1 -UTR, e.g., a non-E. coli source 
such as a human. 

[0034] A DNA molecule is said to be "capable of expressing" a 
polypeptide, such as a heterologous polypeptide, if it contains 
nucleotide sequences which contain transcriptional and 
translational regulatory information, and such sequences are 
"operably linked" to nucleotide sequences which encode the 
polypeptide. An operable linkage is a linkage in which the 
regulatory DNA sequences and the DNA sequence sought to be 
expressed are connected in such a way as to permit gene 
expression. The regulatory regions needed for gene expression 
in general include a promoter region as well as the DNA 
sequences which, when transcribed into RNA, will signal the 
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initiation of protein synthesis. Such regions will normally 
include those 5 ' -non-coding sequences involved with initiation 
of transcription and translation. 

[0035] A promoter region would be operably linked to a DNA 
sequence if the promoter were capable of effecting transcription 
of that DNA sequence. As used herein, a "promoter sequence" is 
the sequence of the promoter which is found on the DNA and is 
transcribed by the RNA polymerase. Thus, to express the 
heterologous polypeptide, transcriptional and translational 
signals recognized by the host cell are necessary. 

[0036] By the term "cold shock", it is meant that a host cell 
is subjected to a drop in incubation/growth temperature well 
below its normal physiological growth temperature, such as a 
temperature drop of at least about 13°C. 

[0037] The term "AT-rich" as applied to a nucleotide sequence 
is defined as having an AT base content of at least 70%. 

[0038] The cold shock inducible promoter and 5 ' UTR sequence 
is preferably any one of the cold shock inducible promoter and 
5' -UTR sequences from an E. coll cspA, cspB, cspG or cspl gene. 
The nucleotide sequences of the promoter and 5 ' -UTR regions as 
well as the coding and 3 • -UTR regions of the above E. coli cold 
shock proteins are presented as SEQ ID N0:1 (cspA) with SEQ ID 
NO: 2 as the encoded CspA amino acid sequence, SEQ ID NO: 3 (cspB) 
with SEQ ID NO: 4 as the encoded CspB amino acid sequence, SEQ ID 
NO: 5 (cspG) with SEQ ID NO: 6 as the encoded CspG amino acid 
sequence, and SEQ ID NO: 7 (cspl) with SEQ ID NO: 8 as the encoded 
Cspl amino acid sequence. The sequences are also available from 
the NCBI GenBank database under accession numbers AE000252 (CspB 
and Cspl), AE000433 (CspA), and AE000201 (CspG). The promoter 
and 5' -UTR region sequences of cspA, cspB, cspG, and cspl 
correspond to nucleotides 22-441 of SEQ ID NO:l, nucleotides 11- 
214 of SEQ ID NO: 3, nucleotides 12-213 of SEQ ID NO: 5, and 
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nucleotides 11-201 of SEQ ID NO: 7, respectively, and are shown 
as an alignment in Fig. 13. 

[0039] As will be appreciated by those of skill in the art, 
the cold shock inducible promoter and 5 1 -UTR for use in the DNA 
molecule of the present invention can be a fragment or a variant 
of the promoter and 5 1 -UTR sequences of any one of cspA, cspB, 
cspG or cspl as long as the ability to direct efficient 
expression and translation under cold shock inducible conditions 
is maintained. Fragments and variants in which certain 
sequences are deleted, substituted or inserted are intended to 
be encompassed by the present invention and guidance is provided 
in the Examples . 

[0040] One preferred embodiment of the present invention is 
where there is an AT rich sequence already present in the 4 th - 
7 th codon of the nucleotide sequence encoding the heterologous 
polypeptide to serve as a translation enhancing element. This 
AT rich sequence can be either present in the naturally 
occurring nucleotide sequence encoding the heterologous 
polypeptide or can be introduced into the coding sequence by, 
i.e., site-directed mutagenesis. For instance, the AT content 
of the region from the codon 4 to codon 7 can be increased by 
replacing the third base of each codon to A or T if it is G or 
C. This can usually be done without altering the amino acid 
sequence of this region because of the degeneracy of codon 
usage . 

[0041] Another preferred embodiment is where the cold shock 
inducible promoter and 5 1 -UTR sequence are preferably linked to 
the nucleotide sequence encoding the heterologous polypeptide by 
a translation enhancing element consisting of a nucleotide 
sequence of seven codons, where the first codon is an initiation 
codon and the 4 th - 7 th codons form the AT rich sequence. An 
embodiment of such a translation enhancing element is the 
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nucleotide sequence of nucleotides 1-18 of SEQ ID NO: 9 as would 
be generated from cloning into the Ndel site of the pColdll 
vector described in Example 3. 

[0042] Preferably, the DNA molecule according to the present 
invention also includes a 3' UTR sequence, such as a 3 1 UTR from 
any one of cspA, cspB, cspG, or cspl . A particularly preferred 
embodiment of a 3 T UTR is the cspA 3 1 -UTR which includes 
nucleotides 442-539 of SEQ ID NO:l or a fragment thereof. 

[0043] In another embodiment, the DNA molecule of the present 
invention may optionally contain a lad operator sequence, such 
as the nucleotide sequence of SEQ ID NO: 10, immediately 
downstream from the transcription initiation site of the cold 
shock inducible promoter. Such a lad operator may be used to 
circumvent problems if the heterologous polypeptide, even at a 
leaky low level of expression under non-induced conditions, is 
toxic to the growth of the host cell at its normal physiological 
growth temperature. In this way, the heterologous protein can 
only be produced under cold shock and in the presence of IPTG. 

[0044] The present invention also provides a vector which 
contains the DNA molecule according to the present invention. 
Preferably, the vector is self -replicating and directs the 
expression of the heterologous polypeptide in a host cell under 
conditions that elicit a cold shock response. Preferred 
embodiments of the vector of the present invention are the 
pColdl, pColdll, and pColdlll vectors described in Example 3. 
These three vectors all have the same pUC19 backbone (cleaved at 
Hindlll and EcoRI and then filled in with Klenow) but the insert 
sequences are different. Figs. 12A-12C show the insert 
sequences for pColdl, II, and III/ respectively. 

[0045] Another aspect of the present invention provides for a 
prokaryotic host cell transformed with the DNA molecule or 
vector according to the present invention. Preferably, the host 
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cell is Escherichia coll. As discussed in Example 4 herein, it 
is preferred that the host cell be a polynucleotide 
phosphorylase (pnpase) mutant lacking polynucleotide 
phosphorylase activity (pnp~) or a rbfA mutant lacking the 15 
kDa RbfA protein that associates with free 30s ribosomal subunit 

but not the 70s ribosomal subunit (rbfA") - Another preferred 

i 

embodiment of the host cell of the present invention is a host 
cell that is both pnp~ and rbfA". Further embodiments include a 
host cell that is a csdA RNA helicase mutant lacking CsdA RNA 
helicase activity (csdA") and a host cell that is co-transformed 
with a vector which overexpresses CsdA RNA helicase in the host 
cell- These csdA" or CsdA overexpressing host cells are 
preferably also pnp" and/or 
rbfA" . 

[0046] A further aspect of the present invention is directed' 
to a method for producing a heterologous polypeptide , including 
but not limited to, a medically important protein such as human 
proteins useful as therapeutics or as targets, proteins 
susceptible to heat denaturation, labeled proteins and proteins 
for use in NMR studies. The method involves culturing the host 
cell of the present invention containing the DNA molecule or 
vector of the present invention in a nutrient medium at the 
normal physiological growth temperature of the host cell and 
then subjecting the cultured host cell to cold shock by lowering 
the incubation temperature to a temperature at least 13°C below 
the normal physiological growth temperature of the host cell to 
induce production of the heterologous polypeptide. The cold 
shocked host cell is kept at the lowered cold shock temperature 
to produce the heterologous polypeptide during the production 
phase. The produced heterologous polypeptide is subsequently 
recovered. In another embodiment, the incubation temperature is 
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lowered to at least 20°C below the normal physiological growth 
temperature of the host cell. 

[0047] The advantages of the method of present invention are 
as follows: 

1. Since protein induction is carried out by 
temperature downshift, no chemical inducers such as 1PTG are 
required . 

2. Upon cold shock, cells are converted into a 
protein-synthesizing machinery dedicated to a single protein of 
interest. The synthesis of all the other proteins is virtually 
blocked in the cells, as all the cellular ribosomes are 
entrapped with the mRNA for the protein cloned in the vector. 

3. Therefore, by exchanging the medium after cold 
shock (for example from a 12 C-glucose/ 14 NH 4 Cl-base medium to a 
13 C-glucose/ 15 NH 4 Cl-base medium) , only the protein of interest 
can be labeled with isotopes, eliminating the purification 
process for the 13 C, 15 N-isotope labeled sample. 

4. If a protein of a molecular weight 20 kDa is 
produced (at a level of 60% of the total cellular protein and 
note that only this protein is newly synthesized upon cold 
shock) , the total yield of the protein per liter is 
approximately 120 mg . Provided that 80% of this protein is 
produced in a soluble form, with a cell suspension in 4 ml 
buffer or a cell lysate prepared from the suspension (sonicated 
and subsequently centrifuged to remove cell debris and the 
membrane fraction), one can obtain 25 mg/ml or 1.25 mM solution 
of the protein of interest, which can be directly used for NMR 
spectroscopy. 

5. As NMR spectroscopy can be done with less than 1 
ml, the culture size now can be reduced to 250 ml, which cuts 
the usage of expensive 13 C-glucose and 15 N-NH 4 C1 [or ( 15 NH 4 ) 2 S0 4 ] to 
a quarter. Furthermore, as the cells are unable to grow upon 
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cold shock under the proposed conditions, the overall carbon 
utilization should be much less than the growing cells. 
Therefore, the concentration of glucose in the medium could be 
less than the concentration usually used for 13 C-glucose (0.2%) . 
This further saves the consumption of the expensive 13 C-glucose. 

6. Some proteins are highly susceptible to heat 
denaturation . In particular, expression of proteins in E. coll 
from those organisms usually living at lower temperature 
environments (such as Caenorhabditis elegans) is problematic to 
express at higher temperature. The proposed cold-shock-vector- 
host system circumvents this problem. 

7. Although expression of some proteins may be higher 
with the T7 vector system, other protein expression may be 
achieved only with the proposed cold-shock-vector-system not 
only because of protein folding problems, but also because of 
transcriptional and translational regulation. This may become 
more and more evident as an extremely large number of genes from 
humans to bacteria become available. 

8 . As more and more medically important human proteins 
are identified, it is important to express these proteins in a 
soluble form and in a large amount. The cold-shock vector system 
according to the present invention serves as an alternative or 
complementary system for the conventional T7 expression system. 

As will be appreciated by those of skill in the art, 
the method for producing a heterologous polypeptide according to 
the present invention can be used to incorporate a detectably 

13 15 

labeled compound, such as a compound labeled with C or N, 
which are isotopes normally found in not more than negligible 
amounts in the host cell. When a detectably labeled compound is 
desired to be incorporated into the heterologous polypeptide 
produced by the present method, - such as for NMR spectroscopy 
analysis, the nutrient medium is exchanged/replaced with a 
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medium containing a compound for detectably labeling the 
heterologous polypeptide. The detectably labeled compound is 
preferably 15 NH 4 C1, ( 15 NH 4 ) 2 S0 4 , 13 C-glucose / a single amino acid 
residue labeled with 15 N, a single amino acid residue labeled 
with 13 C, or a single amino acid residue labeled with both 15 N 
and 13 C. Whereas 15 NH 4 C1, ( 15 NH 4 ) 2 S0 4 , and 13 C-glucose provide non- 
selective labeling throughout the produced heterologous 
polypeptide, a single amino acid residue, i.e., Glu, labeled 
with 15 N and/or 13 C provides selective labeling at residue 
positions in the heterologous polypeptide corresponding to the 
particular single amino acid residue. 

[0048] Example 5 presented hereinbelow demonstrates the 
production of a heterologous polypeptide (ATP-binding domain of 
EnvZ) suitable for NMR spectroscopy of cell lysate supernatant 
without protein purification. The E . coll B strain BL21 used as 
the host cell in Example 5 is not a mutant lacking any of the 
major cold shock proteins, e.g., CspA. However, in the event 
that less background noise in NMR spectroscopy, resulting from 
the presence of labeled major cold shock proteins, is desired, a 
mutant host cell lacking one or more of the major cold shock 
proteins, preferably lacking CspA or a combination of CspA and 
another cold shock protein such as for example in the quadruple 
mutant AcspA AcspBAcspGAcspE, can be used instead. 

[0049] The present invention further provides a method for 
obtaining a NMR spectrum of a heterologous polypeptide. This 
method involves culturing in a nutrient medium a prokaryotic 
host cell transformed with a nucleic acid comprising a 
nucleotide sequence encoding a heterologous polypeptide and 
capable of expressing and producing the heterologous polypeptide 
in response to cold shock. This is followed by subjecting the 
cultured host cell to cold shock by lowering the incubation 
temperature to a temperature below the normal physiological 
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growth temperature of the host cell to induce the production of 
the heterologous polypeptide. After cold shock, the nutrient 
medium is exchanged/replaced with a medium containing a compound 
for detectably labeling the heterologous polypeptide. The cold 
shocked host cell is incubated at the lowered incubation 
temperature to produce the heterologous polypeptide detectably 
labeled with the compound, and then isolated. NMR spectroscopy 
can be conducted on either the isolated cold shocked host cell 
(whole cell NMR spectroscopy) or on the supernatant from cell 
lysate of the isolated cold shocked host cell to obtain a NMR 
spectrum of the heterologous polypeptide. 

[0050] Having now generally described the invention, the same 
will be more readily understood through reference to the 
following examples which are provided by way of illustration and 
are not intended to be limiting of the present invention. 

EXAMPLE 1 

Construction of prototype vectors 

[0051] A pUC19 vector (New England BioLabs, Beverly, MA) 
without the Ndel site was created by digesting the original 
plasmid with Ndel followed by Klenow filling and ligation was 
created. The PCR product including the T7 / cspA promoter, the 5'- 
UTR region of CspA, and its coding region was cloned in the 
pUC19ANdeI plasmid in the EcoRX and Jfindlll sites. These two 
restriction sites were then deleted by Klenow filling. By site- 
directed mutagenesis, Sail and JBamHI sites were created 
immediately after the cspA stop codon. A second DNA fragment 
containing a six His-tag, factor-Xa site for protease cleavage, 
and multiple cloning sites including Ndel, Hindlll, and Kpnl and 
the 3' region of cspA was synthesized and was cloned in this 
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vector. The map of the prototype plasmid vector is shown in 
Fig. 1. 

Cloning and expression of the proteins from Caenorhabdl tls 
eiegans 

[0052] Some proteins are highly susceptible to heat 
denaturation . In particular, expression of proteins in £. coli 
from those organisms usually living at lower temperature 
environments (such as C. elegans) is problematic to express at 
higher temperature. Five C. elegans proteins were chosen and 
were provided by Dr. G. Montelione (designated as WR49, WR35, 
WR26, WR27, and WR53, Genebank accession numbers: AV185320, 
AV183398, C50788, D71923 and C48848, respectively) that were 
cloned in pET vector and could not be expressed well at 37 °C 

(Fig. 2A) . The inserts containing genes encoding these proteins 
were sub-cloned in the cold shock vector described in Fig. 1. 
The cells were grown at 37 °C to the OD600 nm of 0.5 and tjhen 
they were transferred to 15°C and were induced with 1 mM IPTG 

(isopropyl b-D thiogalactopyranoside in order to induce T7 
promoter) and samples were removed at 0, 12 and 24 h and the 
protein expression was analyzed by SDS-PAGE. Fig. 2A shows the 
expression level of all these proteins at 37 °C using pET vectors 
after 3 h induction with 1 mM IPTG and Fig. 2B shows their 
expression at 15°C using the respective cold-shock vectors. The 
proteins WR49 and WR35 could not be expressed at all at 37°C 
using the pET expression vectors (lanes 2 and 4 in Fig. 2A, 
respectively) . The protein WR26 was very poorly expressed (lane 
6, Fig. 2A) . All of these three proteins were greatly expressed 
at 15 °C using cold shock vectors. The expression of proteins 
WR27 and WR53 was also significantly higher using the cold shock 
vectors. Cold-shock vectors were also successfully used for 
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expression of human proteins that otherwise could not be 
expressed well with conventional vectors. 

Time profile of production of C. elegans proteins. 

[0053] Next, the time profile of production of proteins using 
the cold shock vectors was examined. WR49, WR35 and WR26 
proteins that either could not be expressed at all or expressed 
very poorly at 37 °C using the conventional T7 vectors were 
chosen (Fig. 2A) . The cells containing respective plasmids were 
grown at 37 °C to the OD600 nm of 0.5 and then they were 
transferred to 15 °C and were induced with 1 mM IPTG and samples 
were removed at 24 h intervals from 0-120 h and the protein - 
expression was analyzed by SDS-PAGE . Fig. 3 shows the 
expression level of these proteins. Maximum expression of each 
protein was seen within 3-4 days and was maintained stably 
afterwards. Secondly, the expression of all other cellular 
proteins was significantly inhibited, resulting in production of 
protein of interest that is around 85% pure. 

Expression of proteins using cold shock vectors containing only 
the cspA promoter 

[0054] A specific aim of the present inventors is to 
synthesize a protein of interest at a translation efficiency of 
90-95% in a cell, with a protein yield of higher than 60-70% of 
the total cellular proteins, where cell lysate or direct cell 
suspensions may be directly used for NMR spectroscopy. The 
fusion protein poses a problem in this aspect as separation and 
purification of the protein of interest from CspA is required 
before NMR spectroscopy. Hence, the laboratory of the present 
inventors have also constructed the cold-shock vectors with cspA 
promoter without the cspA coding region. The vector also 
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contains a lac operator upstream of the cspA 5'UTR, thus the 
gene of interest can be induced by IPTG. The protein y~ 
interferon that is important from a medical perspective was 
chosen. The gene encoding yinterf eron was cloned in this 
vector and its production at 15°C was checked as for the C. 
elegans proteins described above. Fig. 4A shows expression of 
y-interferon protein at 37 °C and 15 °C. Consistent with the fact 
that the cspA 5'UTR is responsible for the instability of cspA 
mRNA and therefore its lack of expression at 37°C, no y 
interferon expression was observed at this temperature (lane 6) . 
On the other hand, it was well expressed at 15°C (lanes 2-5) . 
However, other cellular proteins are likely to be present as a 
result of carry over from the growth at 37 °C. In order to check 
this possibility the cells were pulse-labeled at 15°C. The 
cells were grown in the labeling medium to the exponential phase 
and shifted to 15 °C. One ml portions of the culture were 
labeled with 5 ml of 35 S-methionine as described previously (Xia 
et al., 2001). Cells were chased by the addition of non- 
radioactive methionine. Cells were washed with 20 mM sodium 
phosphate buffer (pH 7.0) and were resuspended in 100 ml SDS 
sample buffer. Samples were analyzed by SDS-PAGE. y-Interferon 
was now found to comprise more than 90 % of the cellular labeled 
proteins (Fig. 4B, lane 2) . This suggests that y-interferon can 
be almost exclusively produced at 15°C with the cold-shock 
vector. This method thus will be extremely useful in labeling 
the proteins specifically with isotopes such as 13 C and 15 N 
allowing a whole cell NMR spectroscopy without protein 
purification . 
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Identification of mRNA downstream cis-elements favoring the 
initiation of protein synthesis 

[0055] In order to further improve the present cold-shock 
vector system, the laboratory of the present inventors explored 
the mRNA cis-elements downstream of the initiation codon 
favoring the initiation of protein synthesis by constructing a 
molecular repertoire through a lacZ expression system. The 
sequence used was as follows: 5'- GGATCTAGAGGGTATTAATAATGACTGG 
TGCA NNNNNNNNNNNN - 1 a cZ- termination signal-3' (SEQ ID N0:11). The 
initiation codon and the random insertion sequence are 
underlined. This sequence was cloned in IPTG-inducible pINIII 
vector (Inouye, 1983) and the plasmid was transformed in CL83 
cells. 130 clones were randomly selected and sequenced. 

[0056] Out of 130 clones, 57 were chosen for measurement of 
p-galactosidase activity according to the method of Miller 
(Miller, 1992) . Most interestingly, the range of (J- 
galactosidase activity is from 1,500 to 43,000 units, indicating 
that the coding region for codon 4 to codon 7 plays an important 
role in gene expression. Although there is not any one 
particular consensus sequence being responsible for higher 
enzyme activities (data not shown), the ratio of (A+T) / (A+T+G+C) 
was nearly 70% in the clones showing high (5-galactosidase 
activities (Fig. 5) . It was also apparent that in the clones 
showing higher lacZ expression, the most frequently used codons 
were the ones encoding amino acids such as Tyr, Thr, Leu and 
He. Four clones with high (3-galactosidase activity were chosen 
and frame-shift were introduced mutations in each of them. As 
seen from Table 1, none of the mutations had any significant 
effect on the lacZ expression, except of the two mutations that 
resulted in introduction of stop codons (underlined in the 
Table) . 
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TABLE 1 

The Effects of Frame-Shift Mutations on p-Galactosidase Activity 



Insertion sequence: 
Clone 1: 

Original sequence 
Frameshift mutation 
Frameshift mutation 



TGCANNNNNNNNNNNNCCAA (SEQ ID NO: 1 3) p-galactosidase (U) 



ATT TAT ATA TAT 
CATTTATAT ATA 
TTTATATAT ATA 



(SEQ ID NO: 14) 
(SEQ ID NO: 15) 
(SEQ ID NO: 16) 



42440 
41823 
42220 



Clone 2: 

Original sequence 
Frameshift mutation 
Frameshift mutation 

Clone 3: 

Original sequence 
Frameshift mutation 
Frameshift mutation 

Clone 4: 

Original sequence 
Frameshift mutation 
Frameshift mutation 



ACT TTT ACA AAG 
CAC TTT TAC AAA 
CTT TTA CAA AGA 



ACA CAT GAA CAC 
CAC ACATGA ACA 
CAC ATG AAC ACA 



CAT AGT TTT CAA 
CCA TAG TTT TCA 
ATA GTT TTC AAA 



(SEQ ID NO: 17) 
(SEQ ID NO: 18) 
(SEQ ID NO: 19) 



(SEQ ID NO:20) 
(SEQIDNO:21) 
(SEQ ID NO:22) 



(SEQ ID NO:23) 
(SEQIDNO:24) 
(SEQ ID NO:25) 



42150 
41823 
43824 



37228 
519 
35495 



35804 

759 
35100 



[0057] Thus, specific codon usage did not have any obvious 
effect on the lacZ expression . This supports the notion that 
the high AT content is important for higher expression. 
Therefore the translation enhancing effect of the downstream box 
appears to be due to its higher AT (AU) content rather than its 
complementary sequence to the 16S RNA. The A/T-rich sequences 
downstream of the start codon may enhance the protein 
translation in E . coli, because the less stable secondary 
structure of A/T-rich sequences may result in the increased 
number of ribosomes initiating. 



The increased p-galactosidase activity is due to increased 
protein expression 

[0058] Fifty-seven of the above clones were further analyzed 
with respect to their protein profile and p-galactosidase 
activities. The c-lls containing respective plasmids were grown 
at 37 °C to the OD600 nm of 0.5 and were induced with 1 mM IPTG. 
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The protein expression was analyzed by SDS-PAGE and the protein- 
band densities were measured by densitometric analysis. The p- 
galactosidase activities were carried out as described before. 
As seen from Fig. 6, the p-galactosidase activities well 
correspond to the respective protein levels. The present 
inventors conclude that the higher (3-galactosidase activity is 
due to higher specific activity of the protein itself. 

Enhanced translation and not transcription is responsible for 
higher p-galactosidase activity 

[0059] The clones obtained could be divided into four groups 
based on their p-galactosidase activities as clones with enzyme 
units in the range of (i) 43,000-35,000, (ii) 35,000-25,000, 
(iii) 25,000-10,000 and (iv) below 10,000. 16 clones 
representing all these four groups (4 from each) were selected 
and lacZ mRNA levels were checked in these clones using 
oligonucleotides corresponding to la.cZ by primer extension 
(Etchegaray et al . , 1999). Fig. 7 shows p-galactosidase 
activities, protein levels and lacZ mRNA levels in these clones. 
Similar to what was demonstrated in Fig. 6, the p-galactosidase 
activities corresponded to the protein levels. However, the 
clones showing high and low level of p-galactosidase activities 
showed similar levels of lacZ mRNA indicating that the higher p- 
galactosidase activities are not due to the transcription 
upregulation . Thus, the present inventors conclude that it is 
result of translational enhancement. 

A/T (A/U) -rich element downstream of the initiation codon 
enhances translation initiation 

[0060] Next, the laboratory of the present inventors explored 
if translation initiation or elongation was responsible for this 
effect. Two clones showing the highest and lowest activity were 
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chosen. Their sequences, SEQ ID NO: 14 and SEQ ID NO: 26, are 
shown in Fig. 8. The clone with (3-galactosidase activity of 
42,4 40 units had an A/T-rich sequence, ATTTATATATAT (SEQ ID 
NO:14), downstream of the initiating codon, while the clone with 
activity of 4410 units had G/C-rich corresponding sequence, 
CGGTCTCTCCGC (SEQ ID NO:26). The ribosome profiles of these two 
clones were examined. The ribosomes were prepared and 
fractionated as described previously (Etchegaray et al . , 1996 
and Xia et al . , 2001). The primer extension was carried out to 
examine distribution of the lacZ mRNA in the polysome and the 
70S, 50S or 30S ribosomes. The ompA mRNA levels were also 
examined as a negative control. As seen from Fig. 8, the clone 
with higher activity showed higher ratio of lacZ to ompA mRNA in 
the polysome and the 7 0S fraction of the sucrose gradient as 
compared to the clone with lower (3-galactosidase activity. This 
suggests that higher p-galactosidase expression is due to 
enhanced translation initiation, which is mediated by the A/T- 
rich sequence downstream to the initiation codon. 



EXAMPLE 2 

Identification of translational els elements in tlie cspA mRNA 
responsible for the LACE effect. 

[0061] The cspA mRNA contains a 159-base long 5'UTR 
(untranslated region) , which is considered to play the most 
essential role in the LACE effect. The laboratory of the 
present inventors have demonstrated that the LACE effect is 
caused by trapping all cellular functional ribosomes by the cspA 
mRNA, thereby inhibiting all the other cellular mRNAs to form 
the initiation complexes for translation. This ability of the 
cspA mRNA to prevent all the other mRNAs from interacting with 
30S ribosomes is speculated to be due to its highly efficient 
ability to form the initiation complex with ribosomes. 
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[0062] In prokaryotes, there are at least two important 
elements for translation initiation: one is the ribosome binding 
sequence, the so-called Shine-Dalgarano sequence (SD) , and the 
other is the initiation codon AUG . In addition, a number of 
other elements for enhancing translation initiation have been 
reported (Gold et al., 1988). For example, it has been reported 
that there is an translation-enhancing element upstream of the 
SD sequence, termed "Epsilon" (Olins et al . , 1989). It has been 
debated that there is also an element downstream of the 
initiation codon, called "downstream box' (Etchegaray et al . , 
1999a and 1999b; O'Connor et al . 1999) . It is also important to 
note that the secondary structures in the 5'UTR are involved in 
the regulation of translation initiation. In particular the 
formation of a secondary structure at or near the SD sequence is 
often inhibitory for the formation of the initiation complex 
(Yamanaka et al . , 1999). 

[0063] The stability of bacterial mRNAs in the cells is 
another important factor in the regulation of expression of 
genes. Particularly, mRNAs such as the cspA mRNA having an 
unusually long 5'-UTR are in general more prone to endonuclease 
attack. Indeed, the cspA mRNA is highly unstable at higher 
temperature. At 37°C, its half-life is only less than 12 sec and 
this instability is shown to be attributed to the AU-rich 
sequence just upstream of the SD sequence. Base substitution 
mutations at this region has been shown to dramatically 
stabilize the cspA mRNA so that CspA production becomes 
constitutive at 37°C (Fang et al . , 1998). The elements in the 
cspA 5'-UTR essential for translation initiation enhancement 
will be indentified. Through these analyses, it is possible to 
further improve the efficiency of translation initiation. The 
following approaches will be taken. 
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(i) Development of the system to evaluate the translation 
Initiation efficiency 

[0064] The LACE effect has been demonstrated to be caused by 
the highly effective translational initiation capacity of the 
cspA mRNA (Xia et al . , 2001). The LACE effect is defined (a) by 
the antibiotic effect of a clone on colony formation of the 
cells carrying the clone at 15°C, (b) by the effect of the clone 
on the [ 35 S] methionine incorporation into total cellular 
proteins at 15°C, which can be analyzed by SDS-PAGE and (c) by 
growth curve of the cells carrying a LACE-causing clone. 

[0065] In order to evaluate the strength of the LACE effect 
in individual clones, it is important for one to be able to 
quantitatively measure the LACE effect. For example, in case of 
pAOlS, pAlOS and pA30S constructed earlier by inserting a 
nonsense codon at 2nd, 11th, and 31st codons, respectively, in 
the cspA gene, the laboratory of the present inventors showed 
complete inhibition of colony formation at 15 °C, and almost 
complete inhibitory effects on [ 35 S] methionine incorporation 
into cellular proteins at 15°C after cold shock. However, these 
three clones may be able to be evaluated by their effectiveness 
of the LACE effect, if colony formation and protein synthesis 
are measured at higher temperatures. These clones carrying the 
stringent LACE effect may still retain their effective LACE 
effect at 20°C or even at 25°C, while those carrying weaker LACE 
effect may start to form colonies at 20 °C and protein synthesis 
may be observed at this temperature. Therefore, a simple plating 
method should be able to evaluate the effectiveness of 
individual LACE-causing clones. The laboratory of the present 
inventors will test this method with the LACE-causing clones 
available in their laboratory, pAOlS, pAlOS, and pA30S (Xia et 
al., 2001). Cells harboring these plasmids will be plated on 
four LB agar plates and incubated separately at 37, 25, 20 and 
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15°C. The strength of the LACE effect will be classified by 
examining inhibition of colony formation at higher temperatures 
and also by the colony sizes at a given temperature. 

[0066] The LACE effects at different temperatures will then 
be confirmed by [ 35 S] methionine incorporation and growth 
curves. It is likely that one can obtain a similar result for 
evaluation of the LACE effectiveness for individual clones 
constructed by any one of these methods. However, as plating is 
the simplest method among the three, this method will be used if 
it is proved to be as effective as the others. 

(ii) Analysis of the 5' -UTR of the cspA mRNA 

[0067] The secondary structures of a given RNA can be drawn 
in a number of different ways, and in particular, the exact 
secondary structures assumed by one mRNA in a cell are hard to 
be determined. Nevertheless, one can use a computer program to 
predict a likely, stable secondary structure of an mRNA. Fig. 9 
shows the most stable secondary structure of the 5' -UTR of the 
cspA mRNA by computer analysis using the Zuker program (version 
3.1 available at www.bioinfo.math.rpi.edu) for RNA folding. It 
shows that there are 6 secondary structures from I to VI and 
their stabilities are calculated to be -7.0, 

-14.8, -2.1, -8.3, 2.7, 1.5 Kcal, respectively. The secondary 
structure marked VII is the one formed at the 3' end and is 
considered to be p-independent transcription terminator. 

[0068] Among the six secondary structures, structure I is 
known to be the Cold Box which is important for the stability of 
the cspA mRNA; hence, it cannot be eliminated. In the present 
approach to study the role of other structures from II and V, 
they will be deleted one by one to examine the effect of the 
deletion on the LACE effect using pA30S that has a nonsense 
codon at the 31st position in the coding sequence of cspA (Xia 
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et al., 2001). If any effects by the deletion are found, in 
particular in the case of structure II , this large structure 
will be further dissected into small fragments. The stem region 
can be further divided into three parts, Ila, lib and lie as 
shown in Fig. 9. The following deletions will be constructed; 
Alia, Allb, AIIc, AUbAIIc, AIIaAIIb, and AIIaAIIc. These 
deletions in pA30S will be examined for their LACE effect using 
the plating method developed in section (i) . 

The role of structure IV 

[0069] This region was earlier shown play an important role 
in translation efficiency (Yamanaka et al., 1999). The 
laboratory of the present inventors reported that a 13-base 
sequence named the upstream box (bases 123 to 135; UB sequence) 
which is highly conserved in cspA, cspB, cspG and cspl and is 
located 11 bases upstream of the SD sequence might be another 
cis element involved in translation efficiency of the cspA mRNA 
in addition to the SD sequence (Xia et al . , 2001 and Yamanaka et 
al., 1999). The exact requirement of this structure will be 
examined by further dissecting the structure by deletion of 
smaller regions within the structure and by base substitutions. 

The role of structure V 

[0070] It is also interesting to note that there is a cls- 
element upstream of the SD sequence termed "Epsilon" that has 
been proposed to enhance translation of a gene in the 
Bacteriophage T7 (Olins et al. 1989) . The Epsilon sequence, 
UUAACUUUA (SEQ ID NO:27), was originally found in the 
bacteriophage T7 gene 10 leader region. It has been reported 
that when the complementarity to 16S rRNA was extended, the 
epsilon was converted from an enhancer to an independent 
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initiator of translation. It has been shown that Epsilon 
displayed maximal activity as a translational initiator at its 
natural 9-nucleotide-long complementarity to 16S rRNA and at a 
16-nucleotide-long distance to the initiation codon. Under these 
conditions its efficiency was comparable with that of the 
consensus Shine-Dalgarno sequence (Golshani et al . , 2000). The 
sequence of structure V is similar to Epsilon (Figs. 9, 10A and 
10B) . 

[0071] First, the 10-base sequence, UUAAUUAUUA (nucleotides 
16-25 of SEQ ID NO:28), immediately upstream of the SD sequence 
(AAGG) will be analyzed for its role in the LACE effect. This 
Epsilon-like sequence consists of only A and U residues. Various 
deletion mutations (mutants 1, 2, 9, 10 and 11 in Fig. 10A) as 
well as substitution mutations will be constructed (mutants 3 to 
8) . The effects of these mutations on the LACE effect will 
reveal if there is a unique sequence requirement in this element 
as proposed for the Epsilon sequence. If the laboratory of the 
present inventors is able to demonstrate that the 10-base 
sequence has a significant role in enhancing translation (or 
LACE effect) , the sequence will be examined for whether it can 
by itself be capable of initiating translation even without the 
SD sequence. For this purpose, the cspA SD sequence (AAGG) will 
be eliminated by mutating it to AACC (mutation 12 in Fig. 10B) . 
If mutant 12 can exert the LACE effect, the 10-base sequence can 
be deleted to confirm that this region is indeed required for 
ribosome binding (mutant 13) . Subsequently, the requirement of 
the distance between the Epsilon-like sequence and the 
initiation codon will be determined by deletion mutations as 
shown in mutants 14, 15 and 16. 

[0072] It is possible , that the so-called Epsilon sequence 
just functions because of its AU-richness without any sequence 
specificities. For this reason, the laboratory of the present 
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inventors will also scramble or elongate the AU sequence from 
the present 10-bases to AU sequences 13-, 14- and 16- bases in 
length to examine their effect on the LACE effect. If the AU- 
rich sequence is indeed able to enhance translation, polysome 
patterns will be compared between the wild-type cspA RNA, mutant 
1 (Fig. 10A) and mutant 12 (Fig. 10B) to confirm that the LACE 
effect is resulting from trapping cellular ribosomes . 

[0073] Further confirmation of the role of structure V in 
translation enhancement will be carried out with pAlacZ that has 
been already constructed containing the cspA promoter, 5' UTR 
and the coding region of cspA up to its 13th codon fused to lacZ 
gene. Deletions and substitution mutations similar to the ones 
described above will also be created using this pAlacZ vector. 
The AcspA cells will be used as host for transformation to avoid 
the autoregulating effect of cspA produced from the chromosomal 
gene, and the level of cspA expressed from these plasmids will 
be examined by (3-galactosidase assays. Through these 
experiments, the exact role of structure V in translation 
initiation will be determined. 

EXAMPLE 3 

Construction of the cold- shock vectors 

[0074] Cold-shock vectors in three forms will be constructed. 
In the first type, pColdl (Fig. 11A) , is a vector in which the 
coding region of a gene is cloned as a fusion protein with the 
entire coding region of CspA. SEQ ID NO: 45 as shown in Fig. 12A 
is present as an insert within the blunted Hindlll-EcoRI sites 
of pUC19 (New England BioLabs, Beverly, MA) . The second vector, 
pColdll (Fig. 11B) , is a vector in which the coding region of a 
gene of interest is cloned after the 7th codon, where codons 1 
to 7 are MetAsnHisLysValHisMet (SEQ ID NO: 50) at a unique Ndel 
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site created at the 7th Met codon. This extra sequence is 
encoded by the cis-translation enhancing element 
(ATGAATCACAAAGTGCATATG; SEQ ID NO: 9, where the Ndel site is 
underlined) with which a cloned gene expression was shown to be 
8-fold higher than a clone without this sequence (Etchegaray et 
al., 1999). SEQ ID NO:46 as shown in Fig. 12B is present as an 
insert within the blunted Hindlll - EcoRI sites of pUC19. The 
third type, pColdlll (Fig. 11C) , is a vector in which the coding 
region is cloned directly at the initiation codon using an Ndel 
site. In this vector, SEQ ID NO: 47 as shown in Fig. 12C is 
present as an insert within the blunted Hindlll - EcoRI sites of 
pUC19. 

[0075] The three vectors, pColdl, pColdll, and pColdlll are 
identical vectors in that they all have a pUC19 background 
having the cspA promoter, the cspA 5'-UTR region and the cspA 
3' end transcription terminator site. The difference being that 
in addition to this, pColdl has the coding region of cspA, while 
pColdll has the AT rich DB sequence. By including the coding 
region of cspA in pColdl, advantage is taken of its built-in 
high efficiency translation initiation property. It is also 
possible that the folding of the CspA domain at the N-terminal 
end may help the folding of the cloned gene product, as shown in 
the case of MBP (maltose-binding protein) -fusion proteins. In 
addition, pColdl is designed to have a six-His tag followed by 
the factor Xa cleavage site upstream of the Ndel cloning site in 
such a way that the product produced from this vector can be 
purified by a Ni-NTA column and that the N-terminal fragment can 
be removed by factor X cleavage. 

[0076] It is important to note that when the pColdll vector 
is used, six amino acid residues, MetAsnHisLysValHis (residues 
1-6 of SEQ ID NO: 50) are added in front of the initiation codon 
of a given gene. Therefore, it has to be checked if this extra 
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N-terminal extension affects the structure and function of a 
cloned gene product for each case. Advantage can be taken of 
this extra sequence to develop antisera against this sequence, 
which may be used for identification of the product and for 
purification if necessary, 

[0077] All of these vectors are designed in such a way that a 
Ndel-BamHI fragment of a gene of interest can be cloned in the 
same reading frame and is mutually exchangeable between pCold 
and conventional pET vectors. In case a gene of interest is 
toxic for E. coll cell growth, where even a low level of leaky 
expression of the gene may cause a problem during growth at 
37°C. For this reason, pColdl, pColdll and pColdlll having the 
lac J operator sequence, AT T G T GAG C G G AT AA C AA T T GAT G T G (SEQ ID 
NO: 10), immediately downstream of the cspA transcription 
initiation site will be constructed so that the expression of a 
cloned gene can be induced at low temperature only in the 
presence of IPTG. These pCold vectors will be termed pColdlo, 
pColdllo and pColdlllo, respectively. 

[0078] In addition, an attempt will be made to determine the 
best possible SD sequence for higher cold-shock expression and 
the effect of the sequence downstream of the initiation codon 
(downstream box) as follows: 

(i) The SD sequence 

[007 9] The SD sequence found in the cspA mRNA (AAGG) is not 
the best SD sequence compared with a possible SD sequence which 
has the highest complementarity with the 3' end of 16S RNA. It 
may be improved by changing the cspA SD region to a longer 
consensus sequence, for example to AAGGAGGU by inserting AGG in 
the cspA mRNA. The SD sequence will be changed base by base by 
point mutations as shown in Figs. 10A and 10B. Finally, the best 
SD sequence will be determined for the vector. 
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(ii) The translation-enhancing effect by a sequence downstream 
of AUG in pColdlll 

[0080] The so-called Downstream Box (DB) effect indeed exerts 
its effect on translation initiation as judged from the 
experiments on the analysis of random sequences added at the 
regions from the 4th codon to the 7th codon (12 base sequences) . 
As shown in Example 1 (Fig. 5), the DB sequences that 
significantly enhance the LACE effect basically consist of AT- 
rich sequences (but not the complementary sequence to 16S rRNA 
as originally proposed by Sprengart and his coworkers (1996) . 
Interestingly, yinterferon which is expressed at a very high 
level, even if it expressed directly from its own initiating 
codon (Figs. 4A and 4B) , has nine A/T residues out of twelve 
residues in this region. When the pColdlll vector is used, one 
can increase the AT content of the region from codon 4 to codon 
7 replacing the third base of each codon to A or T it is G or C. 
This can be done without altering the amino acid sequence of 
this region because of degeneracy of the codon usage. Once a 
gene is cloned into pColdlll, whether or not the effect of the 
changed AT content in the first 7 codons is better for 
expression of the genes product of each clone will be examined. 

EXAMPLE 4 

Improvement of the host system 

[0081] In order to obtain the maximal yield of a protein to 
be expressed using the cold-shock vectors described above, the 
present inventors would like to improve the host system and to 
establish the growth conditions at low temperature. Important 
considerations for the host improvement are; (i) stabilization 
of the mRNA for the cloned gene, (ii) prolonging of the 
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acclimation phase to block the cold-shock adaptation and (iii) 
increased production of cold-shock ribosomal factor such as 
CsdA, which may enhance the translation initiation and protein 
synthesis . 

(i) Prolonged expression of cloned proteins at 15°C using pnp 
deletion cells as host cells 

[0082] Upon cold shock, E. coll cell growth transiently 
stops , and during this acclimation phase, specific-cold shock 
proteins are highly induced. At the end of the acclimation 
phase, their synthesis is reduced to new basal levels, while the 
non-cold-shock protein synthesis is resumed, resulting in cell 
growth reinitiation. Polynucleotide phosphorylase (PNPase) is a 
cold-shock-inducible exoribonuclease and a component of RNA 
degradosome involved in mRNA degradation. PNPase has been shown 
in the laboratory of the present inventors to be required to 
repress Csp production at the end of the acclimation phase. 
PNPase was found to be essential for selective degradation of 
csp mRNAs at 15°C (Yamanaka et al., 2001). In a pnp mutant, the 
induction of Csps upon cold shock was normally observed as in 
the wild-type strain; however, their production was no longer 
autoregulated and was maintained at high levels throughout cold- 
shock treatment. This resulted in cells' growth arrest and a 
dramatic reduction of colony forming ability below 25°C. As a 
result, upon cold shock, cells maintained a high level of Csps 
even after 24 h. In a poly (A) polymerase mutant and a CsdA RNA 
helicase mutant, Csp expression upon cold shock was 
significantly prolonged, indicating that PNPase in concert with 
poly (A) polymerase and CsdA RNA helicase plays a critical role 
in cold-shock adaptation. Therefore, the use of the pnp 
deletion strain as a host for our cold-shock vectors is likely 
to help to maintain the cloned gene expression for a much longer 
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time period. Such cells are already available in our laboratory. 
It is expected that in the absence of PNPase, cspA expression 
from the cold-shock vectors of the present invention will be 
prolonged, resulting in prolonged expression of the proteins of 
interest. The pnp deletion cells will be transformed with the 
cold-shock vectors containing the inserts corresponding to the 
genes encoding proteins of interest and the cells will be grown 
at 37°C to the OD600 nm of 0.5 and then transferred to 15°C. 
Samples will be removed at 12 h intervals from 0-120 h to 
analyze the protein production by SDS-PAGE. 

(ii) Prolonged expression of cloned proteins at 15 °C using r±>£fl. 
deletion cells as host cells 

[0083] The gene rbfA encodes a 15 kDa protein (RbfA) that 
associates with free 30S but not 70S ribosomal subunits (Dammel 
et al., 1995). The gene was originally isolated as a multicopy 
suppressor of a dominant cold-sensitive mutation located in the 
5' helix of 16S rRNA. The rbfA deletion mutant shows impaired 
growth at low temperature in a similar manner as in the pnp 
deletion cells (Dammel et al . , 1995). The laboratory of the 
present inventors showed that RbfA is a cold-shock protein and 
in the rbfA deletion strain in spite of severe growth inhibition 
at low temperatures, cold-shock proteins and ribosomal proteins 
were highly induced and their expression became constitutive in 
a manner similar to that with the pnp deletion cells (Dammel et 
al., 1995). The laboratory of the present inventors have 
recently shown that RbfA has dual function in that it is 
involved in 30S ribosomal subunit maturation and translation 
initiation at low temperature. Considering the fact that 
deletion of rbfA results in constitutive expression of cold- 
shock proteins such as CspA, these cells will be ideal for 
expression of proteins using the cold-shock vectors at 15°C. 
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[008 4] pnp and rbfA deletion strains have already been 
constructed and will double deletion mutants of these two genes 
will be created for expression of proteins using cold-shock 
vectors. The rbfA mutant strain, named BX41, was constructed by 
transducing a disrupted rbfA allele from strain CD28 . CD28, 
disclosed in Dammel and Noller (1995), was obtained from their 
laboratory and transduced to MC4100 using PI phage. The pnp 
mutant strain, disclosed in an article from Dr. Kushner's 
laboratory, Arraiano et al . (1988), was obtained from his 
laboratory . 

[0085] Phage Plvir-mediated transduction (Miller, 1992) of 
the pnp strain with the rbfA strain will be used to create the 
double deletion mutant. For instance, a PI phage is used to 
make rbfA mutant strain lysate, and then the rbfA f pnpase double 
mutant is made by PI transduction using rbfA lysate and pnpase 
(pnp) mutant strain. The rbfA PCR product of each colony after 
Pi transduction is checked to confirm that they are rbfA 
mutants. Competent cells of these mutant strains are made and 
then transformed with the plNrbfA plasmid into the competent 
cell, incubated at 15°C and checked for cold sensitivity to 
confirm the double mutation. 

(iix) The role of CsdA on the cloned gene expression 

[0086] Poly (A) polymerase of E . coll was reported to 
interact with RNA, RNase E and DEAD-box RNA helicases of RhlE, 
SrmB and CsdA by means of Far Western analysis (Raynal et al . , 
1999) . Among them CsdA is a cold shock-inducible protein and 
has an activity to unwind double-stranded RNA in the absence of 
ATP (Jones et al . , 1996). Similar to RbfA, CsdA is essential 
for growth at low temperature (data not shown) . CsdA function 
was proposed to be essential for ribosome function to increase 
translational efficiencies of mRNAs by unwinding stable 
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secondary structures formed at low temperature (Jones et al., 
1996) . 

[0087] A csdA deletion mutant, in which cold-shock gene 
expression is prolonged similar to that in pnp and rbfA deletion 
strains, was constructed. The kanamycin resistance gene (1.3K 
Hindi fragment) from pUC7Km(Pst) was inserted into the middle 
of the coding region of the csdA gene (alternatively deadD and 
mssB) at the EcoRV site in pKX164, yielding pKNJ9026. The 
linearized pKNJ902 6 DNA fragment containing the disrupted csdA 
(csdA: : Kan) was then introduced into the chromosome of a recD 
mutant FS1576. Kanamycin-resistant transf ormants were isolated 
and the disruption of csdA on the chromosome was confirmed by 
Southern hybridization. The csdA: : Kan mutation was transduced 
into the wild-type strain MC4100, yielding KNJ130. However, the 
csdA deletion may not assist the expression of a cloned gene as 
predicted in the case of pnp and rbfA deletions. This is because 
CsdA may be required for unwinding of mRNAs for non-cold-shock 
proteins. As our cold-shock-vector system will be used for non- 
cold-shock proteins, CsdA in the host system may even help the 
expression of the cloned gene. Note that in contrast to CsdA, 
RbfA is likely not to be required for the cloned gene expression 
with our cold-shock vectors as the elements required for 
translation initiation are from cspA. 

[0088] On the basis of these considerations, the effects of 
not only csdA deletion but also CsdA overexpression on the 
expression of a number of genes cloned in our vector will be 
examined. The overexpression of CsdA will be carried out by 
cloning these genes in a pINIII vector (Inouye, 1983) and will 
be cotransf ormed in the cells with the cold-shock vectors. Using 
pINIII vector, CsdA can be induced with IPTG before cold-shock, 
and its effect on the cold-shock induction of the expression of 
the gene of interest can be evaluated. 
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(iv) Standardization of growth medium conditions 

[0089] The goal of this study is to synthesize a protein of 
interest at a level of 90-95% of total cellular protein 
synthesis with a protein yield of higher than 60-70% of the 
total cellular proteins. Thus cell lysates after removal of 
insoluble materials such as the membrane fraction and the 
ribosomal fraction by centrif ugation may be used for NMR 
spectroscopy without any further purification. In such cases, 
even whole cell suspensions without lysis could be applied for 
NMR spectroscopy. Therefore, by exchanging the medium after cold 
shock (for example from a 12 C-glucose/ 14 NH 4 Cl-base medium to a 
13 C-glucose/ 15 NH 4 Cl-base medium) , only the protein of interest 
can be labeled with isotopes, eliminating the purification 
process. Assuming that a protein of a molecular weight of 20 kDa 
is produced at a level of 60% of the total cellular protein 
(note that only this protein is newly synthesized upon cold 
shock) , the total yield of the protein per liter is expected to 
be approximately 120 mg. If this protein is produced in a 
soluble form and. the cell lysate is prepared in 4 ml from the 
culture (sonicated and subsequently ultracentrif uged to remove 
cells debris, the membrane fraction and the ribosomal fraction) 
one can obtain 30 mg/ml or 1.5 mM solution of the protein of 
interest, which can be directly used for NMR spectroscopy. As 
NMR spectroscopy can be done with less than 1 ml, the culture 
size now can be reduced to 250 ml, which cuts the usage of 
expensive 13 C-glucose and 15 N-NH 4 Cl[or ( 15 NH 4 ) 2 S0 4 ] to a quarter. 
Amino acids labeled with 15 N, 13 C or both can be used singly or 
in combination. Furthermore, as the cells are unable to grow 
upon cold shock under the proposed conditions, the overall 
carbon utilization should be much less than that of the growing 
cells. Therefore, the concentration of glucose in the medium 
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could be less than the concentration usually used for 13 C~ 
glucose (0.2%). This further saves the consumption of the 
expensive 13 C-glucose . 

[0090] In order to determine how much glucose is converted 
during cold-shock incubation of cells affected by the LACE 
effect, the effect of glucose concentrations (0.2%, 0.15%, 0.1%, 
0.05% and 0.025%) on the protein production after cold shock 
will be tested. Cells harboring the expression system described 
Example 1 will be tested. Before cold-shock, cells (10 ml 
cultures) will be collected by centrif ugation, washed with M9 
medium, and resuspended in 10 ml of M9 medium containing 
different concentrations of glucose. These cultures together 
with a control culture (no centrif ugation; a total of 6 
cultures) will be incubated at 15°C and the protein production 
will be examined every 24 h for 5 days. In this fashion, the 
minimum requirement of glucose in the medium will be determined 
without reducing the protein yield. After determining the 
glucose concentration, that condition will be applied for the 
few clones available, which produce soluble proteins in a high 
yield to label them with 13 C and 15 N . NMR spectroscopy on these 
proteins using cell lysate will be carried out and the 
effectiveness of the cold-shock vectors will be evaluated. 

EXAMPLE 5 

NMR Spectroscopy without Protein Purification 

[0091] NMR spectroscopy analysis [ 1 H~ 15 N) HSQC] was performed 
using supernatant from whole cell lysate without protein 
purification. For this experiment, the ATP-binding domain of 
EnvZ (164 residues), an osmosensing histidine kinase in E. coll 
was used as a model system because its NMR solution structure 
has already been resolved in a collaboration between the 
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laboratory of the present inventors and Dr. Ikura of the Ontario 
Cancer Institute (Nature 386:88-92, 1998). 

[0092] A 500 ml culture of E . coll BL21 harboring pCold-EnvZ 
(ATP-binding domain or fragment B) in M9 minimal medium was 
grown to OD 6 oonm of 0.6. Cells were collected by centrif ugation 
and resuspended in 250 ml prechilled M9 minimal medium 
containing 0.1% 15 NH 4 C1 and 0.4% glucose. After the resuspended 
cells were grown for 48 hrs in 15°C, the cells were harvested by 
centrif ugation and lysed by repeated freeze-thaw and sonication. 
After separating the cell debris and the membrane fraction 
(pellet) by ultracentrifugation, the resultant supernatant was 
directly used for NMR analysis. 

[0093] Fig. 14A shows a SDS-PAGE gel of the resultant 
supernatant after ultracentrif ugation stained with Coomassie 
Brilliant Blue. The EnvZ ATP-binding domain in Fig. 14A 
represents almost 80% of the total protein sample. Of 
particular note, most of the minor bands seen in the gel in Fig. 
14A are those proteins carried over from the cells at 37°C f and 
therefore, are not labeled with 15 N. Fig. 14B shows an SDS-PAGE 
gel of total cell lysate, and the resultant pellet and 
supernatant after ultracentrif ugation . 

[0094] A distinct ( 1 H~ 15 N) HSQC NMR spectrum, comparable to 
the spectrum of purified EnvZ ATP-binding domain (Fig. 15B) was 
obtained from the supernatant of total cell lysate without any 
protein purification procedures (Fig. 15A) . This approach of 
using the cold shock vector system to achieve exclusive labeling 
of a single membrane protein with 15 N and/or 13 C allows 
determination of the structure of heterologous proteins and 
membrane proteins In situ without protein purification. 

[0095] Having now fully described this invention, it will be 
appreciated by those skilled in the art that the same can be 
performed within a wide range of equivalent parameters, 



BNSDOCID: <WO 03074657A2_I_> 



WO 03/074657 



- 44 - 



PCT/US03/05531 



concentrations, and conditions without departing from the spirit 
and scope of the invention and without undue experimentation. 

[0096] While this invention has been described in connection 
with specific embodiments thereof, it will be understood that it 
is capable of further modifications. This application is 
intended to cover any variations, uses, or adaptations of the 
inventions following, in general, the principles of the 
invention and including such departures from the present 
disclosure as come within known or customary practice within the 
art to which the invention pertains and as may be applied to the 
essential features hereinbefore set forth as follows in the 
scope of the appended claims. 

[0097] All references cited herein, including journal 
articles or abstracts, published or corresponding U.S. or 
foreign patent applications, issued U.S. or foreign patents, or 
any other references, are entirely incorporated by reference 
herein, including all data, tables, figures, and text presented 
in the cited references. Additionally, the entire contents of 
the references cited within the references cited herein are also 
entirely incorporated by references. 

[0098] Reference to known method steps, conventional methods 
steps, known methods or conventional methods is not in any way 
an admission that any aspect, description or embodiment of the 
present invention is disclosed, taught or suggested in the 
relevant art . 

[0099] The foregoing description of the specific embodiments 
will so fully reveal the general nature of the invention that 
others can, by applying knowledge within the skill of the art 
(including the contents of the references cited herein) , readily 
modify and/or adapt for various applications such specific 
embodiments, without undue experimentation, without departing 
from the general concept of the present invention. Therefore, 
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such adaptations and modifications are intended to be within the 
meaning and range of equivalents of the disclosed embodiments, 
based on the teaching and guidance presented herein. It is to 
be understood that the phraseology or terminology herein is for 
the purpose of description and not of limitation, such that the 
terminology or phraseology of the present specification is to be 
interpreted by the skilled artisan in light of the teachings and 
guidance presented herein, in combination with the knowledge of 
one of ordinary skill in the art. 
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WHAT IS CLAIMED IS: 

1. A DNA molecule capable of expressing a 
heterologous polypeptide , comprising: 

a nucleotide sequence encoding a heterologous 
polypeptide; and 

a cold shock inducible promoter and 5 1 -untranslated 
region (5 f -UTR) operably linked to said nucleotide sequence 
encoding said heterologous polypeptide to control the expression 
and production of said heterologous polypeptide under conditions 
that elicit a cold shock response, 

wherein either (1) a translation enhancing element 
comprising an AT rich sequence of 12 nucleotides at about the 
4 th - 7 th codons downstream from the initiation codon is disposed 
between said cold shock inducible promoter and 5 1 -UTR and said 
nucleotide sequence encoding said heterologous polypeptide to 
form a translational in-frame fusion with said nucleotide 
sequence encoding said heterologous polypeptide or (2) said 
nucleotide sequence encoding said heterologous polypeptide 
comprises an AT rich sequence in the 4 th - 7 th codons downstream 
from the initiation codon. 

2. The DNA molecule of claim 1, wherein said cold 
shock inducible promoter and 5 1 -UTR is selected from the group 
consisting of nucleotides 22-441 of SEQ ID NO:l, nucleotides 
11-214 of SEQ ID NO: 3, nucleotides 12-213 of SEQ ID N0:5, 
nucleotides 11-201 of SEQ ID NO: 7, fragments thereof, and 
variants thereof. 

3. The DNA molecule of claim 1, wherein (1) is 

present . 

4. The DNA molecule of claim 3, wherein said cold 
shock inducible promoter and 5 1 -UTR is linked to said nucleotide 
sequence encoding said heterologous polypeptide by said 
translation enhancing element consisting of a nucleotide 
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sequence of seven codons, wherein the first codon is the 
initiation codon and the 4 th - 7 th codons form said AT rich 
sequence . 

5. The DNA molecule of claim 4, wherein said 
translation enhancing element comprises nucleotides 1-18 of SEQ 
ID NO:9. 

6. The DNA molecule of claim 1, wherein said 
nucleotide sequence encoding said heterologous polypeptide 
comprises an AT rich sequence in the 4 th - 7 th codons downstream 
from the initiation codon. 

7. The DNA molecule of claim 1, further comprising a 
3 f -untranslated region (3 f -UTR) of a cold shock inducible gene 
disposed downstream from the nucleotide sequence encoding said 
heterologous polypeptide . 

8. The DNA molecule of claim 7 , wherein said 3 1 -UTR 
is selected from the 3 1 -UTR of the group consisting of cspA, 
cspB, cspG, and cspl . 

9. The DNA molecule of claim 7, wherein said 3 r -UTR 
comprises nucleotides 442-539 of SEQ ID NO:l. 

10. The DNA molecule of claim 1, wherein said cold 
shock inducible promoter and 5 1 -UTR region sequence comprises a 
lacl operator sequence immediately downstream of the 
transcription initiation site. 

11. The DNA molecule of claim 10, wherein the lacl 
operator sequence is the nucleotide sequence of SEQ ID NO: 10. 

12. The DNA molecule of claim 1, which is a vector. 

13. The vector of claim 12, which is a self- 
replicating vector . 

14. The vector of claim 13, which is pColdl . 

15. The vector of claim 13, which is pColdll. 

16. The vector of claim 13, which is pColdlll. 
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17. A prokaryotic host cell transformed with the DNA 
molecule of claim 1. 

18. The host cell of claim 17, which is Escherichia 

coli . 

19. The host cell of claim 17, which is a 
polynucleotide phosphorylase mutant lacking polynucleotide 
phosphorylase activity. 

20. The host cell of claim 17, which is a rbfA mutant 
lacking the 15 kDa RbfA protein that associates with free 30S 
ribosomal subunit but not the 7 0S ribosomal subunit. 

21. The host cell of claim 17, which is a csdA RNA 
helicase mutant lacking csdA ■ RNA helicase activity. 

22. The host cell of claim 17, which is co- 
transformed with a vector which overexpresses csdA RNA helicase 
in said host cell. 

23. A method for producing a heterologous 
polypeptide, comprising : 

culturing the host cell of claim 17 in a nutrient 
medium at the normal physiological growth temperature of the 
host cell; 

subjecting the cultured host cell to cold shock by 
lowering the incubation temperature to a temperature at least 
13°C below the normal physiological growth temperature of the 
host cell to induce production of the heterologous polypeptide; 
and 

incubating the cold shocked host cell at the lowered 
incubation temperature to produce the heterologous polypeptide. 

24. The method of claim 23, further comprising the 
step of recovering the produced heterologous polypeptide. 

25. The method of claim 23, wherein the incubation 
temperature is lowered to at least 20°C below the normal 
physiological growth temperature of the host cell. 
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26. The method of claim 23 , further comprising the 
step of exchanging the nutrient medium with a medium containing 
a compound for detectably labeling the heterologous polypeptide 
after subjecting the cultured host cell to cold shock. 

27. The method of claim 26, wherein said compound 
contains an isotope normally found in negligible amounts in the 
host cell. 

28. The method of claim 27, wherein said isotope is 

15 N or 13 C. 

29. The method of claim 27, wherein said compound is 
selected from the group consisting of 15 NH 4 C1, ( 15 NH 4 )2S04, 13 C- 
glucose, amino acid residues labeled with 15 N, amino acid 
residues labeled with 13 C, and amino acid residues labeled with 
both 15 N and 13 C. 

30. The method of claim 27, wherein the host cell is 
a mutant lacking one or more of the major cold shock proteins. 

31. The method of claim 30, wherein said mutant host 
cell lacks CspA. 

32. The method of claim 30, wherein said mutant host 
cell lacks' CspA, CspB, CspG, and CspE. 

33. A metfiod for obtaining a NMR spectrum of a 
heterologous polypeptide, comprising: 

isolating the cold shocked host cell incubated in the 
method of claim 2 6 at the lowered incubation temperature to 
produce the heterologous polypeptide detectably labeled with 
said compound; and 

conducting either whole cell NMR spectroscopy on the 
isolated cold shocked host cell or NMR spectroscopy on 
supernatant from cell lysate of the isolated cold shocked host 
cell to obtain a NMR spectrum of the heterologous polypeptide. 



BNSDOCID: <WO 03074657A2_I_> 



WO 03/074657 



- 54 - 



PCT/US03/05531 



34. The method of claim 33, wherein whole cell NMR 
spectroscopy on the isolated cold shocked whole cell is 
conducted. 

35. The method claim 33 , wherein NMR spectroscopy on 
supernatant from cell lysate of the isolated cold shocked host 
cell is conducted. 

36. A method for obtaining a NMR spectrum of a 
heterologous polypeptide, comprising : 

culturing in a nutrient medium a prokaryotic host cell 
transformed with a nucleic acid comprising a nucleotide sequence 
encoding a heterologous polypeptide and capable of expressing 
and producing the heterologous polypeptide in response to cold 
shock; 

subjecting the cultured host cell to cold shock by 
lowering the incubation temperature to a temperature below the 
normal physiological growth temperature of the host cell to 
induce the production of the heterologous polypeptide; 

exchanging the nutrient medium with a medium 
containing a compound for detectably labeling the heterologous 
polypeptide after subjecting the cultured host cell to cold 
shock; 

incubating the cold shocked host cell at the lowered 
incubation temperature to produce the heterologous polypeptide 
detectably labeled with said compound; 

isolating the cold shocked host cell incubated at the 
lowered incubation temperature; and 

conducting either whole cell NMR spectroscopy on the 
isolated cold shocked host cell or NMR spectroscopy on 
supernatant from cell lysate of the isolated cold shocked host 
cell to obtain a NMR spectrum of the heterologous polypeptide. 
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SD 

Wt CGCCGAAAGGCACACUUAAUUAUUAAAGGUAAUACACUAUGUCC 

1 CGCCGAAAGGC AC AC C _ J AAGGU AAUACACU AUG UCC 

2 CGCCGAAAGGCACACUUAAUUDUU AAAGGU AAUACACU AUGU CC 

3 CGCCGAAAGGCACACUUGAUUAUU AAAGG UAAUACAC UAUG UCC 

4 CGCCGAAAGGCACACUUAACUAUU AAAGGU AAUACACU AUGU CC 

5 CGCCGAAAGGCACACUUGACUAUU AAAGG UAAUACAC UAUG UCC 
FIG . 1 OA 6 CGCCGAAAGGCACACUUAAUUGUP AAAGGU AAUACAC UAUG UCC 

7 CGCCGAAAGGCACACUTJAACUGUU AAAGGU AAUACACU AUGU CC 

8 CGCCGAAAGGCACACUUGACUGUU AAAGGU AAUACAC UAUGU CC 

9 CGCCGAAAGGCACACCZ3AUUAUU AAAGGU AAUACACU AUG UCC 

1 0 CGCCGAAAGGCACACUUAIZZZJAUU AAAGG UAAUACACU AUGU CC 

1 1 CGCCGAAAGGCACACUUAAUUCZZZ 3AAGGU AAUACAC UAUGU CC 

1 2 CGCCGAAAGGC AC ACUUAAUUAUUAAACCUAAUAC ACUAUGUCC 

1 3 CGCCGAAAGGCACACt ZZ3 AACCUAAUAC ACUAUGUCC 
FIG . 1 OB 14 CGCCGAAAGGCACACUUAAUUAUUACZICCUAAUACACU AUG UCC 

1 5 CGCCGAAAGGCAC ACUUAAUUAUUAEZ=ZUAAUAC ACUAUGUCC 

16 CGCCGAAAGGCACACrTTTAATTTTATTTT A t I CACUAUGUCC 
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Insertion sequence for pColdl 



aaccgattaatcataaatatgaaaaataattgttgcatcacccgccaatgcgtggcttaatgcacatca 
cspA promoter 

acggtttgacgtacagaccattaaagcagtgtagtaaggcaagtcccttcaagagttatcgttgataccc 
ctcgtagtgcacattcctttaacgcttcaaaatctgtaaagcacgccatatcgccgaaaggcacacttaa 
ttattaaaggtaatacact 

cspA 5'UTR 

atgtccggtaaaatgactggtatcgtaaaatggttcaacgctgacaaaggcttcggcttcatcactcctga 
cgatggctctaaagatgtgttcgtacacttctctgctatccagaacgatggttacaaatctctggacgaag 
gtcagaaagtgtccttcaccatcgaaagcggcgctaaaggcccggcagctggtaacgtaaccagcctg 
cspA coding region 

gtcgac catcatcatcatcatcat atcgaaggtagg catatgaagcttggtaccggatcc 
His tag factor Xa site Ndel BamHl 

tctctgcttaaaagcacagaatctaagatccctgccatttggcggggatttttttatttgttttcaggaaataaat 
aatcgatcgcgtaataaaatct 

cspA3'UTR 



FIG. 12A 
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Insertion sequence for pColdll 



aaccgattaatcataaatatgaaaaataattgttgcatcacccgccaatgcgtggcttaatgcacatca 

cspA promoter 

acggtttgacgtacagaccattaaagcagtgtagtaaggcaagtcccttcaagagttatcgttgataccc 
ctcgtagtgcacattcctttaacgcttcaaaatctgtaaagcacgccatatcgccgaaaggcacacttaa 
ttattaaaggtaatacact 

cspA 5'UTR 



atgaatcacaaagtg catatg aagcttggtacc ggatcc 
DB ~ Ndel " " Bam HI 

tctctgcttaaaagcacagaatctaagatccctgccatttggcggggatttttttatttgttttcaggaaataaat 
aatcgatcgcgtaataaaatct 

cspA3 r UTR 



FIG. 12B 
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Insertion sequence for pColdlll 



aaccgattaatcataaatatgaaaaataattgttgcatcacccgccaatgcgtggcttaatgcacatca 

cspA promoter 

acggtttgacgtacagaccattaaagcagtgtagtaaggcaagtcccttcaagagttatcgttgataccc 
ctcgtagtgcacattcctttaacgcttcaaaatctgtaaagcacgccatatcgccgaaaggcacacttaa 
ttattaaaggtaatacact 

cspA 5 ? UTR 

cat atg aa gc ttg g tac c ggatcc 
Ndel " BamHI 

tctctgcttaaaagcacagaatctaagatccctgccatttggcggggatttttttatttgttttcaggaaataaat 
aatcgatcgcgtaataaaatct 

cspAS'UTR 
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cspl 

CSpG 



cspA 



-10 

TTTTCTTTA<3^AAAGTAAlfcCtrTGCT — CT 



GCCGGACGG. T 
GC.GGA.GT.r. 
CA.AAA.ATC 




.T\ATC-CAA.T -AGC.GC-G.C 

T?ATA. .T.C. .CT -.AT.GC-G.C 

.ATZACCC.CCA.TG.G. :GZ. . . . . .CACA.CAA-C 



+1 Cold Box 

5GTTTGTTACAAATTT \TCTGAAGCAGTCATTGT 

AA.G-C.GAC .G.AT-CAA. . .AG. .TA 

-AAGA.C.GAC 3ATAT.C.A. . .AG. .TA 
. ACGT . C . GAC — . AT-TAA . . CAG -GTA 



cspl 
CSpG 
cspB 
cspA 



TATAA- -TTTTATTATTTGTACCrrCTTGAGATTTCCTTGTTGGTTTT — TCTCTCT G A TA TTT -T-TT-TT-CGGACCATTC 

CTA. .GCAG . .C.CA-. . — . T . AGGCATT . TCC -C-T- . . .GAG CCT. .A-CAC. .ACT AG. . .C.G-.A. . AAA.C. .TG.T 

CTA. .GCAG. .C.C. .-. .C-.GG-G. .ATTCAC.TA. .CC.TC. . .GAG CAATT . AGT ACG . AGTCG . . .C.G. . A. GCAAAC . AT . .A 

GTA. GGCAAG . CCCT . C-AAGAG . TA . CGTTGA . A . CCC . C . TAG . GCACA.TC - TTTAACGCTTCA . AATCTG . AAAGC-AC . C . ATA. . 




SD 



Downstream 
Box 



VTTTCTTCAA — ftAGC rAATAAT-TATGTCTAACaU\*|ft!IX^ 

— AA. .TG .... \.G. . .AA. T. 

A.GT1 \. - .GTAGA .A. .T. 

AC|.C. . AA.T.TT^ . . . , T.A. . CAC .CGGT. 



.A.C. 
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EnvZ ATP binding domain 



FIG. 14A 



1 



5 6 7 8 



fragment B 
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NMR without protein purification 



¥ — 103 
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NMR with purified protein 



'418 



HSQC of Apo-EnvzB 
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SEQUENCE LISTING 



<110> INOUYE, Masayori 
XIA, Bing 
PHADTARE, Sangita 
Qing, Guoliang 
KE, Haiping 

<120> COLD SHOCK INDUCIBLE EXPRESSION AND PRODUCTION OF HETEROLOGOUS 
POLYPEPTIDES 

<130> INOUYE=2.1 PCT 

<150> 60/361,069 
<151> 2002-03-01 

<150> 60/402,921 
<151> 2002-08-14 

<160> 50 

<170> Patentln version 3,2 

<210> 1 

<211> 539 

<212> DNA 

<213> Escherichia coli 



<220> 

<221> CDS 

<222> (229) . . (441) 

<400> 1 



aaccgattaa 


tcataaatat 


gaaaaataat 


tgttgcatca 


cccgccaatg 


cgtggcttaa 


60 


tgcacatcaa 


cggtttgacg 


tacagaccat 


taaagcagtg 


tagtaaggca 


agtcccttca 


120 


agagttatcg 


ttgatacccc 


tcgtagtgca 


cattccttta 


acgcttcaaa 


atctgtaaag 


180 


cacgccatat 


cgccgaaagg 


cacacttaat 


tattaaaggt 


aatacact atg tec ggt 


237 



Met Ser Gly 
1 

aaa atg act ggt ate gta aaa tgg ttc aac get gac aaa ggc ttc ggc 285 
Lys Met Thr Gly lie Val Lys Trp Phe Asn Ala Asp Lys Gly Phe Gly 
5 10 15 

ttc ate act cct gac gat ggc tct aaa gat gtg ttc gta cac ttc tct 333 
Phe lie Thr Pro Asp Asp Gly Ser Lys Asp Val Phe Val His Phe Ser 
20 25 4 ^ 30 35 

get ate cag aac gat ggt tac aaa tct ctg gac gaa ggt cag aaa gtg 381 
Ala lie Gin Asn Asp Gly Tyr Lys Ser Leu Asp Glu Gly Gin Lys Val 
40 45 50 

tec ttc ace ate gaa age ggc get aaa ggc ccg gca get ggt ■ aac gta 429 
Ser Phe Thr lie Glu Ser Gly Ala Lys Gly Pro Ala Ala Gly Asn Val 
55 60 65 

acc age ctg taa tetctgetta aaagcacaga atctaagatc cctgccattt 4 81 

Thr Ser Leu 
70 
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ggcggggatt tttttatttg ttttcaggaa ataaataatc 'gatcgcgtaa taaaatct 539 

<210> 2 

<211> 70 

<212> PRT 

<213> Escherichia coli 

<400> 2 

Met Ser Gly Lys Met Thr Gly lie Val Lys Trp Phe Asn Ala Asp Lys 
1 5 10 15 

Gly Phe Gly Phe lie Thr Pro Asp Asp Gly Ser Lys Asp Val Phe Val 
20 25 ~ 30 

His Phe Ser Ala lie Gin Asn Asp Gly Tyr Lys Ser Leu Asp Glu Gly 
35 40 45 

Gin Lys Val Ser Phe Thr lie Glu Ser Gly Ala Lys Gly Pro Ala Ala 
50 55 60 

Gly Asn Val Thr Ser Leu 
65 70 

<210> 3 

<211> 563 

<212> DNA 

<213> Escherichia coli 
<220> 

<221> CDS 

<222> (215) . . (430) 

<400> 3 

gctggatgtc taaaataaac attgcttcat atgttcaact atgcgttaat gattgcgtcg 60 

gtttgaagaa cagacgatat acgaagtagt ttactaaagc agttctcatt tcaggtgtta 120 

ttcacttatt ccttctttga gtctctccaa ttaagtacga agtcgtttct gttatgcaaa 180 

ccatttatgc cgaaaggctc aagttaagga atgt aga atg tea aat aaa atg act 235 

Arg Met Ser Asn Lys Met Thr 
1 5 

ggt tta gta aaa tgg ttt aac get gat aaa ggt ttc ggc ttt att tct- 283 
Gly Leu Val Lys Trp Phe Asn Ala Asp Lys Gly Phe Gly Phe lie Ser 
10 15 " 20 

cct gtt gat ggt agt aaa gat gtg ttt gtg cat ttt tct gcg att cag 331 
Pro Val Asp Gly Ser Lys Asp Val Phe Val His Phe Ser Ala He Gin 
25 30 35 

aat gat aat tat cga acc tta ttt gaa ggt caa aag gtt acc ttc tct 37 9 

Asn Asp Asn Tyr Arg Thr Leu Phe Glu Gly Gin Lys Val Thr Phe Ser 
40 " 45 50 55 
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ata gag agt ggt get aaa ggt cct gca gca gca aat gtc ate att act 427 
He Glu Ser Gly Ala Lys Gly Pro Ala Ala Ala Asn Val He lie Thr 
60 65 70 

gat taaaattcat cgctcgtctg tatacgataa cgaagaaggc tgatgcctga 480 
Asp 

gtagagatac ggacagagta gtgaatattg gatctcttta ataaaaagta aggaggtcca 54 0 

atacatgaaa caatggctag cat 5 63 

<210> 4 
<211> 72 
<212> PRT 

<213> Escherichia coli 
<220> 

<223> Synthetic 
<400> 4 

Arg Met Sef Asn Lys Met Thr Gly Leu Val Lys Trp Phe Asn Ala Asp 
15 10 15 

Lys Gly Phe Gly Phe He Ser Pro Val Asp Gly Ser Lys Asp Val Phe 
20 25 30 

Val His Phe Ser Ala lie Gin Asn Asp Asn Tyr Arg Thr Leu Phe Glu 
35 4 0 * 45 

Gly Gin Lys Val Thr Phe Ser lie Glu Ser Gly Ala Lys Gly Pro Ala 
50 55 60 

Ala Ala Asn Val He He Thr Asp 
65 70 

<210> 5 

<211> 520 

<212> DNA 

<213> Escherichia coli 
<220> 

<221> CDS 

<222> (214) . . (426) 

<400> 5 

cgccggacgg ctaaaataaa atttgettaa tctcaattat catgcgttaa tagctgegtc 60 

ggtttgaaag acagacagca tacaaagtag tttactaaag cagttctcat tatcaggcat 120 

tatccccttc ttttgagtct ctctcctgaa cactaagtag tttctgtatt aaagccctgt 180 

ttgccgaaag geccaaaatg aaggaagtaa aat atg tct aat aaa atg act ggt 234 

Met Ser Asn Lys Met Thr Gly 
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tta gta aaa tgg ttt aac gca gat aaa ggt ttt ggc ttt ate act cct 282 
Leu Val Lys Trp Phe Asn Ala Asp Lys Gly Phe Gly Phe lie Thr Pro 
10 15 20 

gat gat ggc age aaa gac gtt ttc gtc cat ttc acc gec ate cag age 330 
Asp Asp Gly Ser Lys Asp Val Phe Val His Phe Thr Ala lie Gin Ser 

25 30 35 ' 

aat gaa ttc cgc acg ctg aac gaa aat cag aaa gtt gaa ttt tct att 378 
Asn Glu Phe Arg Thr Leu Asn Glu Asn Gin Lys Val Glu Phe Ser He 
40 45 50 55 

gag cag ggg caa cgt ggc ccc gcg gca gcg aac gtt gtt acg etc taa 426 
Glu Gin Gly Gin Arg Gly Pro Ala Ala Ala Asn Val Val Thr Leu 
60 65 70 

ggttgccatt attactcaac atctccattt ccgctgtcca tgttgtcatg gttcacagta 486 

ccgcacatcg gcattcgatg tgacggagcg aaac 520 

<210> 6 
<211> 70 
<212> PRT 

<213> Escherichia coli 
<220> 

<223> Synthetic 
<400> 6 

Met Ser Asn Lys Met Thr Gly Leu Val Lys Trp Phe Asn Ala Asp Lys 
15 10 15 

Gly Phe Gly Phe He Thr Pro Asp Asp Gly Ser Lys Asp Val Phe Val 
20 25 30 

His Phe Thr Ala He Gin Ser Asn Glu Phe Arg Thr Leu Asn Glu Asn 
35 40 45 

Gin Lys Val Glu Phe Ser He Glu Gin Gly Gin Arg Gly Pro Ala Ala 
50 55 60 

Ala Asn Val Val Thr Leu 
65 70 

<210> 7 

<211> 560 

<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (202) . . (414) 
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<400> 7 

ttttctttac aaaagtaatc cttgctatgg gtggttaatc atgcgttaat ggtgttctgg 60 

tttgttacaa atttatctga agcagtcatt gttataattt tattatttgt acctcttgag 120 

atttccttgt tggtttttct ctctgatatt ttttttcgga ccattctgcc caagggctaa 



tttcttcaaa aggtaataat t atg tct aac aaa atg act ggt tta gtg aaa 

Met Ser Asn Lys Met Thr Gly Leu Val Lys 
15 10 

tgg tgt aac cct gaa aaa ggt ttt ggt ttc ate acg ccg aaa gag ggc 
Trp Cys Asn Pro Glu Lys Gly Phe Gly Phe lie Thr Pro Lys Glu Gly 
15 ~ 20 25 

age aaa gat gtg ttt gtc cat ttc tea gca atg cag age aac gat ttc 
Ser Lys Asp Val Phe Val His Phe Ser Ala Met Gin Ser Asn Asp Phe 
30 35 40 

aaa aca tta act gag aat cag gaa gtt gaa ttt ggt att gag aac gga 375 
Lys Thr Leu Thr Glu Asn Gin Glu Val Glu Phe Gly lie Glu Asn Gly 
45 50 55 



cct aaa ggt cct gee get gtt cat gta gtg gcg ctt tga ggtagacaat 
Pro Lys Gly Pro Ala Ala Val His Val Val Ala Leu 
60 65 70 



<210> 


8 


<211> 


70 


<212> 


PRT 


<213> 


Escherichia coli 


<220> 




<223> 


Synthetic 


<400> 


8 


Met Ser Asn Lys Met Thr 


1 


5 



10 15 

Gly Phe Gly Phe lie Thr Pro Lys Glu Gly Ser Lys Asp Val Phe Val 
20 25 30 

His Phe Ser Ala Met Gin Ser Asn Asp Phe Lys Thr Leu Thr Glu Asn 
35 40 45 

Gin Glu Val Glu Phe Gly lie Glu Asn Gly Pro Lys Gly Pro Ala Ala 
50 55 60 



Val His Val Val Ala Leu 
65 70 



180 
231 

279 

327 



424 



attacaaacc atattcactt tagatgeccg tgttgtcatg gttcccagta tagaacatca 484 
tcttttgatg tttctgacat gaatcctttc ggggcaaaat gtatcttttg taaatcaatg 54 4 
atgattacat ttgata 



560 
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<210> 9 

<211> 21 

<212> DNA 

<213> Escherichia coli 



<4 00> 9 

atgaatcaca aagtgcatat g 21 

<210> 10 
<211> 26 
<212> DNA 

<213> Escherichia coli 
<400> 10 

attgtgagcg gataacaatt gatgtg 2 6 



<210> 11 

<211> 44 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



Sequence 



<220> 

<221> mis cofeature 

<222> (33).. (44) 

<223> n is a, c, g or t. 

<400> 11 

ggatctagag ggtattaata atgactggtg cannnnnnnn nnnn 4 4 



<210> 12 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



Sequence 



<220> 

<221> misc_feature 

<222> (13) . . (24) 

<223> n is a f c, g or t. 

<400> 12 

atgactggtg cannnnnnnn nnnn 



<210> 13 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



Sequence 
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<220> 

<221> misc_feature 

<222> (5).. (16) 

<223> n is a, c, g or t. 

<400> 13 

tgcannnnnn nnnnnnccaa 20 



<210> 14 

<211> 12 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<210> 15 

<211> 12 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 15 

catttatata ta 12 



<210> 16 

<211> 12 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<210> 17 

<211> 12 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 17 

acttttacaa ag 12 



<210> 18 

<211> 12 

<212> DNA 

<213> Artificial Sequence 



<400> 14 
atttatatat at 



12 



<400> 16 
tttatatata ta 



12 
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<220> 

<223> Synthetic 



<400> 18 
cacttttaca aa 



12 



<210> 19 

<211> 12 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 19 

cttttacaaa ga 12 



<210> 20 

<211> 12 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<210> 21 

<211> 12 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 21 

cacacatgaa ca 12 

<210> 22 

<211> 12 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<210> 23 

<211> 12 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<400> 20 
acacatgaac ac 



12 



<400> 22 
cacatgaaca ca 



12 



Page 8 



BNSDOCID: <WO 03074657A2_I_> 



WO 03/074657 



PCT/US03/05531 



<400> 23 

catagttttc aa 12 



<210> 24 

<211> 12 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 24 
ccatagtttt ca 



<210> 25 

<211> 12 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 25 
atagttttca aa 



<210> 26 

<211> 12 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 26 
cggtctctcc gc 

<210> 27 

<211> 9 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 27 
uuaacuuua 



<210> 28 

<211> 44 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<400> 28 

cgccgaaagg cacacuuaau uauuaaaggu aauacacuau gucc 



44 
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<210> 29 

<211> 34 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 29 

cgccgaaagg cacacaaggu aauacacuau gucc 34 

<210> 30 

<211> 43 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<210> 31 

<211> 44 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 31 

cgccgaaagg cacacuugau uauuaaaggu aauacacuau gucc 4 4 

<210> 32 

<211> 44 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<210> 33 

<211> 44 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 33 

cgccgaaagg cacacuugac uauuaaaggu aauacacuau gucc 44 



<400> 30 

cgccgaaagg cacacuuaau uuuaaaggua auacacuaug ucc 



43 



<400> 32 

cgccgaaagg cacacuuaac uauuaaaggu aauacacuau gucc 



44 



<210> 34 
<211> 44 
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<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 34 

cgccgaaagg cacacuuaau uguuaaaggu aauacacuau gucc 44 

<210>. 35 

<211> 44 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<210> 36 

<211> 44 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 36 

cgccgaaagg cacacuugac uguuaaaggu aauacacuau gucc 4 4 

<210> 37 

<211> 41 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<210> 38 

<211> 41 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 38 

cgccgaaagg cacacuuaau uaaagguaau acacuauguc c 41 

<210> 39 

<211> 40 

<212^ RNA 

<213> Artificial Sequence 



<400> 35 

cgccgaaagg cacacuuaac uguuaaaggu aauacacuau gucc 



44 



<400> 37 

cgccgaaagg cacacauuau uaaagguaau acacuauguc c 



41 
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<220> 

<223> Synthetic 



<400> 39 

cgccgaaagg cacacuuaau uaagguaaua cacuaugucc 



40 



<210> 40 

<211> 44 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 40 

cgccgaaagg cacacuuaau uauuaaaccu aauacacuau gucc 4 4 

<210> 41 

<211> 34 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<210> 42 

<211> 42 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 42 

cgccgaaagg cacacuuaau uauuaccuaa uacacuaugu cc 4 2 

<210> 43 

<211> 39 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<210> 44 

<211> 35 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 



<400> 41 

cgccgaaagg cacacaaccu aauacacuau gucc 



34 



<400> 43 

cgccgaaagg cacacuuaau uauauaauac acuaugucc 



39 
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<400> 44 
cgccgaaagg 


cacacuuaau 


uauuacacua 


ugucc 






35 


<210> 45 
<211> 596 
<212> DNA 

<213> Artificial Sequence 










<220> 

<223> Synthetic 












<400> 45 
aaccgattaa 


t cataaatat 


gaaaaataat 


tgttgcatca 


cccgccaatg 


cgtggcttaa 


60 


tgcacatcaa 


cggtttgacg 


tacagaccat 


taaagcagtg 


tagtaaggca 


agtcccttca 


120 


agagttatcg 


ttgatacccc 


tcgtagtgca 


cattccttta 


acgcttcaaa 


atctgtaaag 


180 


cacgccatat 


cgccgaaagg 


cacacttaat 


tattaaaggt 


aatacactat 


gtccggtaaa 


240 


atgactggta 


tcgtaaaatg 


gttcaacgct 


gacaaaggct 


tcggcttcat 


cactcctgac 


300 


gatggctcta 


aagatgtgtt 


cgtacacttc 


tctgctatcc 


agaacgatgg 


ttacaaatct 


360 


ctggacgaag 


gtcagaaagt 


gtccttcacc 


atcgaaagcg 


gcgctaaagg 


cccggcagct 


420 


ggtaacgtaa 


ccagcctggt 


cgaccatcat 


catcatcatc 


atatcgaagg 


taggcatatg 


480 


aagcttggta 


ccggatcctc 


tctgcttaaa 


agcacagaat 


ctaagatccc 


tgccatttgg 


540 


cggggatttt 


tttatttgtt 


ttcaggaaat 


aaataatcga 


tcgcgtaata 


aaatct 


596 


<210> 46 
<211> 365 
<212> DNA 

<213> Artificial Sequence 










<220> 

<223> Synthetic 












<400> 46 
aaccgattaa 


tcataaatat 


gaaaaataat 


tgttgcatca 


cccgccaatg 


cgtggcttaa 


60 


tgcacatcaa 


cggtttgacg 


tacagaccat 


taaagcagtg 


tagtaaggca 


agtcccttca 


120 


agagttatcg ttgatacccc 


tcgtagtgca 


cattccttta 


acgcttcaaa 


atctgtaaag 


180 


cacgccatat 


cgccgaaagg 


cacacttaat 


tattaaaggt 


aatacactat 


gaatcacaaa 


240 


gtgcatatga 


agcttggtac 


cggatcctct 


ctgcttaaaa 


gcacagaatc 


taagatccct 


300 


gccatttggc 


ggggattttt 


ttatttgttt 


tcaggaaata 


aataatcgat 


cgcgtaataa 


360 


aatct 












365 



<210> 47 

<211> 350 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Synthetic 
<400> 47 

aaccgattaa tcataaatat gaaaaataat tgttgcatca cccgccaatg cgtggcttaa 60 

tgcacatcaa cggtttgacg tacagaccat taaagcagtg tagtaaggca agtcccttca 120 

agagttatcg ttgatacccc tcgtagtgca cattccttta acgcttcaaa atctgtaaag 180 

cacgccatat cgccgaaagg cacacttaat tattaaaggt aatacactca tatgaagctt 24 0 

ggtaccggat cctctctgct taaaagcaca gaatctaaga tccctgccat ttggcgggga 300 

tttttttatt tgttttcagg aaataaataa tcgatcgcgt aataaaatct 350 



<210> 48 
<211> 162 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 
<400> 48 

acgguuugac guacagacca uuaaagcagu guaguaaggc aagucccuuc aagaguuauc 60 
guugauaccc cucguagugc acauuccuuu aacgcuucaa aaucuguaaa gcacgccaua 12 0 
ucgccgaaag gcacacuuaa uuauuaaagg uaauacacua ug 162 



<210> 49 

<211> 25 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 49 

ucccugccau uuggcgggga uuuuu 25 

<210> 50 

<211> 7 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic 

<400> 50 

Met Asn His Lys Val His Met 
1 5 



Page 14 



<WO 03074657A2_I_> 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property 
Organization 
International Bureau 

(43) International Publication Date 
12 September 2003 (12.09.2003) 




PCT 



lllllllllllllllllllllllll 

(10) International Publication Number 

WO 2003/074657 A3 



(51) International Patent Classification 7 : C12N 1/21, 

5/10, 15/63, C12P 21/02, G01V 3/14 

(21) International Application Number: 

PCT/US2003/005531 

(22) Internationa] Filing Date: 25 February 2003 (25.02.2003) 

(25) Filing Language: English 

(26) Publication Language: English 



(30) Priority Data: 

607361,069 
60/402,921 



1 March 2002 (01.03.2002) US 
14 August 2002 ( 14.08.2002 ) US 



(71) Applicant (for all designated States except US): TAKARA 
BIO, DSC. [JP/JP]; 4-1, Seta 3-chome, Otsu-shi, Shiga 
520-2193 (JP). 

(71) Applicant (for SD only): MCINNIS, Patricia [US/US]; 
2325 42nd Street, N.W., Apt. 203, Washington, D.C., DC 
20007 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): INOUYE, Masay- 
ori [JP/US]; The Colony House, Apt. 16K, 1050 George 
Street, New Brunswick, NJ 08901 (US). PHADTARE, 
Sangita [IN/US]; 12-D Bartle Court, Highland Park, 
NJ 08904 (US). XIA, Bing [CN/US]; Apartment 9, 280 
Grove Street, Newton, MA 02466 (US). QING, Guoliang 



[CN/US]; 900 Davidson Road, 69 Nichols Apt., Piscat- 
away, NJ 08854 (US). KE, Haiping [CN/US]; 102-3G 
Montgomery Street, Highland Park, NJ 08904 (US). 

(74) Agents: BROWDY AND NEIMARK, P.L.L.C. et al.; 
624 Ninth Street N.W., Suite 300, Washington, DC 20001- 
5303 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, FI, GB, GD, GE, GH, 
GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, 
MX, MZ, NO, NZ, OM, PH, PL, PT, RO, RU, SC, SD, SE, 
SG, SK, SL, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, 
VC, VN, YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, BG, CH, CY, CZ, DE, DK, EE, 
ES, FI, FR, GB, GR, HU, IE, IT, LU, MC, NL, PT, SE, SI, 
SK, TR), OAPI patent (BF, BJ, CF, CG, CI, CM, GA, GN, 
GQ, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— with international search report 

(88) Date of publication of the international search report: 

16 September 2004 

For two- letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



< 
so 



(54) Title: COLD SHOCK INDUCIBLE EXPRESSION AND PRODUCTION OF HETEROLOGOUS POLYPEPTIDES 



(57) Abstract: The present invention relates to a DNA molecule or vector and a host cell containing this DNA molecule or vector 
O which can be used to produce a heterologous polypeptide under conditions that elicit a cold shock response in the host cell. The 
DNA molecule and vector include a nucleotide sequence encoding a heterologous polypeptide and a promoter and 5-UTR from a 
cold shock inducible gene which directs its expression. In addition, an AT-rich sequence that enhances translation under cold shock 
inducible conditions is either present in the coding sequence of the heterologous polypeptide or in an additional element inserted 
between the coding sequence and the cold shock inducible promoter and 5'-UTR. 



BNSOOCID: <WO 



_03074657A3_I_> 



Intern ailontU application No. 

INTERNATIONAL SEARCH REPORT 

PCT/US03/0S531 




A. CLASSIFICATION OF SUBJECT MATTER 

IPCC7) s C12N 1/21, 15/63, 5/10; CUP 21/02; G01V 3/14 

USCL : 324/3095 435/69.1,252.3,252.33,320.1,325,419 
a „ T^^rTt^Wnrrnl p-itmr n R^flrai^m /TPO> or lo VjoAi national rlnssificatioii ana iv^ — , , — — 


B. FIELDS SEARCHED _ . — " 

Mtritomn duttimuntaaon searched (classfficaticn system followed by classification symbol*) 
U.S. : 324/309; 435/69. *, 252.3, 252.33, 320J, 325, 419 


Documematioa searched other than mimxnum dccuineBiauoa! to the extent that such oncuinenis are included m the fields parched 


Electronic data base consulted during die international search (name of data base and, where practicable, s< 
Please See Condnuatiozi Sheet 

C. DOCUMENTS CONSIDERED TO BE RELEVANT — — 


sarch termfi used) 


Category * 
A 

A.P 

A 


Citation of document, with indicatioa, vfeere appropriate, of the relevant passages 
US 6 T 294.7U Bl (MEUU2WAETER et al) 25 September 2001 (25.09.2003), see entire 
document. 

US 6,479.260 Bl (TAKAYAMA et al) 12 November 2002 (12. 1 1 .2002), see entire 
document* 

US 6,333 , 191 Bl (INOL) YE et al> 25 December 2001 (25. 12.2001), see entire document. 


Relevant to claim No. 

>-36 
1-36 


1 J Rirtha 


r documents are listed in me wrunmation of Box C. |_] See patent family amies. 


, — . „ . — -T" luick' document pubUtWfid efto* l-C iDianasucnai ming 

• Special coUfiOf Iti of eked documenta: • priurffy do»o aad noi In conflict with ihe appHcaiioo to* =Und to 
-A- docomcui definin R the p^ral of the M «*» b not consi acred to undent the prlndpl* or tb»ry underlying the fa**- 

be of particular relevance document of particular rdevancc* Ac ctuimed invention cannot be 
^ _j - rt.- « ,™^?o^ni nn«<» coDPidofcd novol or cannot be cnoeldcr=d to Involve an inventive 
earfScr application ^ p*=* published on of after *hs uiterparioniu fifing S ,S. document h taken -lone 
date 

doc™** which may throw douh« on priori* - «hi* £ ^ ^ 

to caiapiiih The pubiiualaa dale «T another catwion at other *pcci*l rc-too coScdwiih on* or more otfiw eiich document aueb 
<as ipecificd) combination being obviou* to a pcrxco ikJUed la the an 
-O- document rercmas fa an oral <&sc\ckutc, wc, exhibition or other men* ^ docUmcnl n^bcr of the same patent fumll y 
-P" documeat published prior io ibe imc«»ioooJ filin B date but later tfraaihe , 




Date of the aerial completion of the international search 

2? August 2003 (22.08.2003* 


Date of mailing or the miarnauonai searcn report 

nQ.iUM20M 




N<-me and making address of the ISA/US 
Mail Slap PCT. Ami: ISA/US 
Commissioner for PnleHts 
P.O. Box 1450 

Alexandria. Virginia 22313-1450 

Facsimile No* 


Terry A. McKelvcy / / 
Telephone No. 7Q3-30&-0196 A 





PormPcrr/ISA/2J0 (second shunt) (July 1998) 



BNSDOCID: <WO 03074657A3J_> 



INTERNATIONAL SEARCH REPORT 



PCT/US03/05531 



Continuation of B. FIELDS SEARCHED Item 3: 

EAST (USPAT, PGPUB, EPO, JPO, Derwent databases), Dialog OneSeach (biotech databases) 
search terms: cold shock, indue?, promoter?, regulat?, AT rich, translation, enhanc?, utr, laci 



Form PCT/ISA/210 (second sheet) (July 1998) 



BNSDOCID: <WO 03074657A3_I_> 



V 



THIS PAGE BIAUK (uspto) 



